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Summary of the Invention 
The present invention is based, at least m part, 
on the discovery of cDNA molecules encoding TANGO 191 and 
TANGO 195, both of which are transmembrane proteins. 
5 These proteins, fragments, derivatives, and variants 
thereof are collectively referred to as "polypeptides of 
the invention" or "proteins of the invention." Nucleic 
acid molecules encoding polypeptides of the invention are 
collectively referred to as "nucleic acids of the 
10 invention. " 

The nucleic acids and polypeptides of the present 
invention are useful as modulating agents in regulating a 
variety of cellular processes. Accordingly, in one 
aspect, this invention provides isolated nucleic acid 
15 molecules encoding a polypeptide of the invention or a 
biologically active portion thereof. The present 
invention also provides nucleic acid molecules which are 
suitable as primers or hybridization probes for the 
detection of nucleic acids encoding a polypeptide of the 
20 invention. 

The invention features nucleic acid molecules 
which are at least about 45% (or 55%, 65%, 75%, 85%, 95%, 
or 98%) identical to the nucleotide sequence shown in SEC 

ID NO: 1, 3; 4, 6 or , or the nucleotide sequence of 

25 the cDNA insert of either the clone deposited with the 
American Type Culture Collection, Manassas, VA (ATCC) as 

Accession Number or the clone deposited with the 

ATCC as Accession Number {the "cDNA of ATCC " 

or the "CDNA of ATCC "), or a complement thereof. 

The invention features nucleic acid molecules 
which include a fragment of at least 30C (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 700, 800, 90C , 1000, 
or 1200) nucleotides of the nucleotide sequence of SEQ 
ID N0:1, 3, 4, 6 or or the nucleotiae sequence of the 



30 
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NOVEL SECRETED IMMUNOMODULATORY PROTEINS 
AND USES THEREOF 



5 



p^^rkaroun ri of the Invention 
Many secreted proteins, for example, cytokines and 
cytokine receptors, play a vital role in the regulation 
of cell growth, cell differentiation, and a variety of 
specific cellular responses. A number of medically 
15 useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth 
hormone, and various interleukins , are secreted proteins. 
Thus, an important goal in the design and development of 
new therapies is the identification and characterization 
20 of secreted proteins and the genes which encode them. 

Many secreted proteins are receptors which bind a 
ligand and transduce an intracellular signal, leading to 
a variety of cellular responses. The identification and 
characterization of such a receptor enables one to 
25 identify both the ligands which bind to the receptor and 
the intracellular molecules and signal transduction 
oathways associated with the receptor, permitting one to 
identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal 
30 transduction. 
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Also within the invention are isolated 
polypeptides or proteins which are encoded by a nucleic 
acid molecule having a nucleotide sequence that is at 
least about 65%, preferably 75%, 85%, or 95% identical to 
a nucleic acid sequence encoding SEQ ID NO: 2, 5 or 
and isolated polypeptides or proteins which are enc^d 
by a nucleic acid molecule having a nucleotide sequence 
which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 

of SEQ ID NO: 1, 3, 4, 6 or a complement thereof or 

the non- coding strand of the cDNA of ATCC or the 

cDNA of ATCC . 

Also within the invention are polypeptides which 

are a naturally occurring allelic variants of a 

polypeptide that includes the amino acid sequence of SEQ 

ID N0:2, 5 or an amino acid sequence encoded by the 

CDNA of ATCC , or an amino acid sequence encoded by 

the CDNA of ATCC , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 

nucleic acid molecule having the sequence of SEQ ID NO: 

^' ^' ^ °^ a complement thereof under stringen- 

conditions . 

Tr.e invention also features nucleic acid moiecuiet- 
thac hybridize under stringent conditions to a nucleic 
acid molecule having the nucleotide sequence of SEQ ID 

N0:1, 3, 4, or 6, the cDNA of ATCC or the cDNA of 

ATCC , or a complement thereof. In other 

embodiments, the nucleic acid molecules are at least 300 
(325. 35C, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
80C. 90C, 1000, or 1290) nucleotides m length and 
hybridizes under stringent conditions to a nucleic acid 
molecule having the nucleotide sequence of SEQ ID N0:1, 

^- 5/ or the cDNA ATCC , or the cDNA of ATCC 

, cr a complement thereof. 
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cDNA of ATCC or the cDNA of ATCC , or a 

complement thereof. 

The invention also features nucleic acid molecules 
which include a nucleotide sequence encoding a protein 
5 having an amino acid sequence that is at least about 45% 
{or 55%; 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO: 2, 5 or _ or the amino 

acid sequence encoded by the cDNA of ATCC or the 

cDNA of ATCC , or a complement thereof. 

In preferred embodiments, the nucleic acid 
molecules have the nucleotide sequence of SEQ ID NO: 1, 

3^ 4^ 6 or or the nucleotide sequence of the cDNA of 

;^TCC / or the cDNA of ATCC . 

Also within the invention are nucleic acid 
15 molecules which encode a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2 or 5 the fragment 
including at least 15 (25, 30, 50, 100, 150, 300, or 400) 
contiguous amino acids of SEQ ID NO: 2 or 5, the 

polypeptide encoded by the cDNA of ATCC , or the 

20 polypeptide encoded by the cDNA of ATCC . 

The invention includes nucleic acid molecules 
which encode a naturally occurring allelic variant of a 
polypeptide comprising the ammo acid sequence of SEQ ID 

NO: 2, 5 or the amino acid sequence encoded by the 

2 5 cDNA of ATCC , or the amino acid sequence encoded by 

the cDNA of ATCC , wherein the nucleic acid molecule 

hybridizes to a nucleic acid molecule having a nucleic 

acid sequence encoding SEQ ID NO: 2, 5 or or a 

complement thereof under stringent conditions. 

Also within the invention are isolated 
polypeptides or proteins having an amino acid sequence 
that is az least about 65% preferably 75%, 85%, 95%, or 
98% identical co the amino acid sequence of SEQ ID NO: 2, 
5 or 



30 



wo 00/18800 



PCT/US99/22818 



- 6 - 

polypeptide; (2) the ability to bind a iigand of the 
naturally-occurring polypeptide; (3) the ability to bind 
to an intracellular target of the naturally-occurring 
polypeptide. Other activities include: (i) the ability 
5 to modulate cellular proliferation; (2) the ability to 
modulate cellular differentiation; and (3) the ability to 
modulate cell death. 

In one embodiment, a polypeptide of the invention 
has an amino acid sequence sufficiently identical to at 

10 least one domain of a polypeptide of the invention. As 
used herein, the term "sufficiently identical" refers to 
a first amino acid or nucleotide sequence which contains 
a sufficient or minimum number of identical or equivalent 
(e.g., with a similar side chain) amino acid residues or 

15 nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide 
sequences have a common structural domain and/or common 
fiinctionai activity. For example, amino acid or 
nucleotide sequences which contain a common structural 

20 domain having about 65% identity, preferably 75% 

identity, more preferably 85%, 95%, or 98% identity are 
defined herein as sufficiently identical. 

In one embodiment, the isolated polypeptide lacks 
both a transmembrane and a cytoplasmic domain. In 

2 5 another embodiment the polypeptide lacks both a 

transmembrane domain and a cytoplasmic domain and is 
soluble under physiological conditions. 

The polypeptides of the present invention, or 
biologically active portions thereof, can be operably 

3 0 linked to a heterologous amino acid sequence to form a 

fusion protein. The invention further features 
antibodies that specifically bind a polypeptide of the 
invention such as monoclonal or polyclonal antibodies. 
In addition, the polypeptides of the invention or 
3 5 biologically active portions thereof can be incorporated 
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In preferred embodiments, the isolated nucleic 
acid molecules encode a cytoplasmic (SEQ ID N0:11 or 16), 
transmembrane (SEQ ID NO:10 or 15), or extracellular (SEQ 
ID NO: 9 or 14) domain of a polypeptide of the invention 
5 or a complement thereof. In another embodiment, the 
invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid 
of the invention. 

Another aspect of the invention provides vectors, 
10 e.g., recombinant expression vectors, comprising a 
nucleic acid molecule of the invention. In another 
embodiment, the invention provides host cells containing 
such a vector. The invention also provides methods for 
producing a polypeptide of the invention by culturing, m 
15 a suitable medium, a host cell of the invention 

containing a recombinant expression vector such that a 
the polypeptide is produced. 

Another aspect of this invention features isolated 
or recombinant proteins and polypeptides of the 
20 invention. Preferred proteins and polypeptides possess 
at least one biological activity possessed by the 
corresponding naturally-occurring human polypep-ide. An 
activity, a biological activity, and a functional 
activity of a polypeptide of the invention refers to an 
25 activity exerted by a protein, polypeptide or nucleic 
acid molecule of the invention on, for example, a 
responsive cell as determined in vivo, or in vitro, 
according to standard techniques. Such activities can be 
a direct activity, such as an association with or an 
30 enzymatic activity on a second protein or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the protein with a second protein. 
Thus, such activities include, e.g., (1; the ability to 
form prctein:protein interactions with proteins in the 
3 5 signaling pathway of the naturally-occurring 
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nucleic acid of the invention. m other embodiments, the 
modulator is a peptide, pepuidomimecic, or other small 
molecule . 

The present invention also provides diagnostic 
5 assays for identifying the presence or absence of a 

genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a polypeptide of the invention, (ii) mis- 
regulation of a gene encoding a polypeptide of the 
10 invention, and (iii) aberrant post-translational 

modification of a protein of the invention wherein a 
wild- type form of the gene encodes a protein having the 
activity of the protein of the invention. 

In another aspect, the invention provides a 
15 method fcr identifying a compound that binds to or 

modulates the activity of a polypeptide of the invention. 
In general, such methods entail measuring a biolocrical 
activity of the polypeptide in the presence and absence 
of a test compound and identifying those compounds which 
20 alter the activity of the polypeptide. 

The invention also features methods for 
identifying a compound which modulates the expression of 
a polypeptide or nucleic acid of the invention by 
measuring the expression of the polypeptide or nucleic 
2 5 acid in the presence and absence of the compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims. 



Brief De scription of the Drawings 
Figure 1 depiccs the cDNA sequence (SEQ ID N0:1) 
and predicted amino acid sequence (SEQ ID NO: 2) of human 
TANGO ISl. The open reading frame of SEQ ID N0:1 extends 
from nucleotide 557 to 2353, inclusive (SEQ ID N0:3). 
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into pharmaceutical compositions, which optionally 
include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides 
methods for detecting the presence of the activity or 
5 expression of a polypeptide of the invention in a 
biological sample by contacting the biological sample 
with an agent capable of detecting an indicator of 
activity such that the presence of activity is detected 
in the biological sample, 
10 In another aspect, the invention provides methods 

for modulating activity of a polypeptide of the invention 
comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a 
polypeptide of the invention such that activity or 
15 expression in the cell is modulated. In one embodiment, 
the agent is an antibody that specifically binds to a 
polypeptide of the invention. 

In another embodiment, the agent modulates 
expression of a polypeptide of the invention by 
20 modulating transcription, splicing, or translation of an 
mRNA encoding a polypeptide of the invention. In yet 
another embodiment, the agent is a nucleic acid molecule 
having a nucleotide sequence that is antisense zo the 
coding strand of an mRNA encoding a polypeptide of the 

2 5 invention. 

The present invention also provides methods for 
treating a subject having a disorder characterized by 
aberrant activity of a polypeptide of the invention or 
aberrant expression of a nucleic acid or polypeptide of 

3 0 the invention by administering an agent which is a 

modulator of the activity of a polypeptide of tne 
invention or a modulator of the expression of a nucleic 
acid or polypeptide of the invention to the subiect. In 
one embodiment, the modulator is a protein of the 
3 5 invent icr:. In another embodiment, the modulator is a 
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Detailed Description of the Invention 
The present invention is based on the discovery of 
cDNA molecules encoding TANGO 191 and TANGO 195, both of 
which are secreted proteins. The subsections and Tables 
5 summarize certain features of TANGO 191 and TANGO 195. 

TANGO 191 

The human TANGO 191 cDNA of SEQ ID N0:1 has a 1797 
nucleotide open reading frame (SEQ ID N0:3) encoding a 
599 amino acid protein (SEQ ID NO: 2) . The cDNA and 

10 protein sequences of human TANGO 191 are shown in Figure 
1. This cDNA was isolated from a human mixed lymphocyte 
reaction library based on its sequence similarity to 
genes encoding certain members of the interleukin- 1 (IL- 
1) receptor superfamily. 

15 Human TANGO 191 is a transmembrane protein having 

■ a 19 amino acid signal sequence (amino acids 1 - 19 of 
SEQ ID NO: 2; SEQ ID NO: 7) followed by a 58 0 amino acid 
mature protein (amino acids 20 - 599 of SEQ ID N0:2; SEQ 
ID N0:8). Mature TANGO 191 is predicted to have a 

20 transmembrane domain that extends from amino acid 3 58 to 
amino acid 382 of SEQ ID N0;2 (SEQ ID N0:10), an 
extracellular domain that extends from amino acid 2 0 to 
amino acid 357 of SEQ ID N0:2 (SEQ ID N0:9), and a 
cytoplamic domain extending from amino acid 3 83 to amino 

25 acid 599 of SEQ ID N0:2 (SEQ ID NO: 11) . 

TANGO 191 has a molecular weight of 68.3 kDa prior 
to cleavage of its signal peptide and a molecular weight 
of 66.1 kDa after cleavage of its signal peptide. 

TANGO 191 has four potential N-glycosylation sites 

30 (amino acids 21-24, 119-122, 152-155, and 345-248 of SEQ 
ID NO: 2); 15 potential protein kinase C phosphorylation 
sites (ammo acids 26-28, 35-37, 63-65, 160-162, 203-205, 
233-235, 272-275, 307- 309, 311-313, 327-329, 474-476, 
506-508, 538-540, 575-577, and 590-592 of SEQ ID N0:2); 
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Figure 2 is a hydropathy plot of TANGO 191. 
Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked " C " . 

Figure 3 depicts the cDNA sequence (SEQ ID NO: 4) 
and predicted amino acid sequence (SEQ ID NO: 5) of a 
partial human TANGO 195 clone. The open reading frame of 
SEQ ID NO: 4 extends from nucleotide 166 to 1101, 
inclusive (SEQ ID NO: 6) . 

Figure 4 is a hydropathy plot of TANGO 195. 
Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked "0". 

Figure 5 depicts an alignment of the amino acid 
sequences of human TANGO 195 (SEQ ID NO: 5) and human SLAK 
(SEQ ID KO:20). In this alignment the sequences are 
22,8% identical overall. 

Figure 6 depicts an alignment of portions of TANGO 
191 with PF00047, an IG superfamily domain HMM (SEQ ID 
NOs :21, 22, and 23) . 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ; of 

murine TPI^GO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 376, inclusive (SEQ 

ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: / 

and predicted amino acid sequence (SEQ ID NO: ) of a 

partial human TANGO 195 clone (T195Athpb93f 1) . The open 
reading frame extends from nucleotide 159 to 1118, 
inclusive (SEQ ID NO: ). 

Figure 9 depiccs the cDNA sequence (SEQ ID NO: 

and predicted amino acid sequence (SEQ ID NO: ; of a 

full length TANGO 195 clone {T195AthLal70f 10) . The open 
reading frame extends from nucleotide 2 5 to nucleotide 
879, inclusive (SEQ ID NO: ). 
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which engages a protein complex that acts to activate NK- 
yB. Members of the NK-yB family regulate many of immune 
and inflammatory genes that are induced by IL-1. 

Since TANGO 191 has some similarity to IL-1 
5 receptor, TANGO 191 nucleic acids and polypeptides as 
well as modulators of TANGO 191 expression or activity 
are useful in the treatment of a variety of immune and 
inflammatory disorders, e.g., asthma, graft vs-host 
disease, rheumatoid arthritis, psoriasis, inflammatory 
10 bowel disease, septic shock, ulcerative colitis, Crohn's 
disease, chronic myelogenous leukemia, cancer, liver 
disease, Hodgkin's disease, osteoarthritis, Lyme' disease, 
cachexia, and autoimmune diseases, e.g., myasthenia 
gravis, autoimmune diabetes, and lupus. 

15 TANGO 195 

The human TANGO 195 partial cDNA of SEQ ID NO: 4 
has a 93 6 nucleotide open reading frame (SEQ ID NO: 6) 
encoding a 312 amino acid protein (SEQ ID NO: 5) . The 
cDNA and protein . sequences of human TANGO 195 clone are 

20 shown in Figure 3. This partial TANGO 195 cDNA clone was 
isolated from a human mixed lymphocyte reaction library 
based on its homology to signalling lymphocyte activation 
marker (Cocks et al . (1995) Nature 376:260-63). Apparen- 
full-iength clones (3.0 kb and 1.3 kb) were isolated from 

2 5 the same library and a human mid- term placental library. 

The portion of human TANGO 195 encoded by the cDNA 
of SEQ ID NO: 4 is a transmembrane secreted protein having 
a 22 amino acid signal sequence (amino acids i - 22 of 
SEQ ID NO: 5; SEQ ID NO: 12) followed by a 2 91 amino acid 

30 mature protein (amino acids 23 - 312 of SEQ ID NO: 5; SEQ 
ID NO: 13 ; . The portion of mature TANGO 195 encoded by 
the cDNA of SEQ ID NO: 4 is predicted to have a 
transmembrane domain that extends from amino acid 234 to 
amine acid 254 of SEQ ID N0:5 (SEQ ID N0:I5), an 
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12 potential casein kinase II phosphorylation sites 
(amino acids 36-39, 89-92, 133-136, 224-227, 294-297, 
301-304, 311-314, 327-330, 401-404, 427-430, 490-493, and 
585-588 of SEQ ID N0:2) ; one potential tyrosine kinase 
5 phosphorylation site (amino acids 205-212 of SEQ ID 
N0:2); and six potential N-myristoylat ion sites (amino 
acids 117-122, 168-173, 217-222, 366-371, 460-465, and 
583-588 of SEQ ID N0:2) . 

Figure 2 is a hydropathy plot of TANGO 191. 
10 Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked " C " . 

Northern analysis of human TANGO 191 mRNA 
expression revealed that it is expressed in spleen, lymph 
15 node, peripheral blood lymphocytes, and bone marrow, 

A clone {EPftX191a) containing a cDNA encoding 
TANGO 191 inserted into pZL-1 (GIBCO/BRL; Bethesda, MD) 
between the NotI and Sail sites was deposited with the 
American Type Culture Collection, Manassas, VA on 

20 September 9, 1998, and assigned Accession Number . 

Human TANGO 191 appears to be a member of the IL-1 
receptor superfamily. TANGO 191 includes three regions 
(amino acida 71-123 of SEQ ID N0:2; SEQ ID N0:17;; amino 
aicds 166-223 of SEQ ID N0:2; SEQ ID N0:18); amino acids 
25 266-339 of SEQ ID N0:2; SEQ ID NO: 19) which have homology 
to the IG superfamily domain (PF00047) that is 
characteristic of members of the IL-1 superfamily (Figure 
6) . 

IL-1 receptor (IL-IR) plays a critical role the 
30 regulation of immune and inflammatory responses. 

Signalling by IL-IR requires that IL-IR form a complex 
with IL-lAcP, a protein which may be required for 
internalization of IL-IR. It is thought that both IL-IR 
and IL-lAcP interact with IRAK- 2. It has been proposed 
35 that this multiprocein complex interacts with TRAF6, 
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full length TANGO 195 clone {T195AthLal70f 10 ) . The open 
reading frame extends from nucleotide 2 5 to nucleotide 

879, inclusive (SEQ ID NO: ). 

The full-length TANGO 195 protein of Figure 9 is 
5 predicted to be a transmembrane protein having a 22 amino 
acid signal sequence (amino acids 1-22 of SEQ ID NO: 

SEQ ID NO: ) followed by a 2 63 amino acid mature 

protein (amino acids 23-285 of SEQ ID NO: ) . This form 

of TANGO 195 is predicted to have a transmembrane domain 

10 extending from amino acid 234 to amino acid 254 of SEQ ID 

NO: (SEQ ID NO: ) , an extracellular domain extending 

from amino acid 23 to amino acid 233 of SEQ IQ NO: 

(SEQ ID NO: ) and a cytoplasmic domain that extends 

from ammo acid 255 to amino acid 2 85 of SEQ ID NO: 

15 (SEQ ID NO: ) . 

In situ expression analysis of TANGO 195 in adult 
mice revealed expression in the spleen (mutlifocal 
expression with expression highest in follicles) , thymus 
(multifocal expression) , and lymph node (multifocal 

20 expression) . No expression was detected in lung and 
stomach. In situ expression analysis was also used to 
examine expression in the spleens of adult mice 1, 3, 5, 
and 14 pose -immunization with EFA/PBS. In each case 
multifocal expression was observed with expression being 

25 highest in the follicles. The expression at 14 days 
post - immunization was somewhat lower than in at other 
time points. 

Northern analysis of TANGO 195 expression revealed 
the presence of a 1.8 kb transcript and a 3 . 4 kb 

30 transcript in spleen, lymph node and thymus and a 1.8 kb 
transcript was observed in bone marrow with expression 
being highest in lymph node. Additional Northern 
analysis revealed expression in the following tissues (in 
decreasing order of expression; : lymph node, stomach, 

3 5 small intestine, appendix, lung, spleen, and bone marrow. 
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extracellular domain that extends from amino acid 23 to 
amino acid 233 of SEQ ID N0:5 (SEQ ID NC:14), and a 
cytoplamic domain extending from amino acid 255 to amino 
acid 312 of SEQ ID N0:5 (SEQ ID M0:16). 
5 TANGO 195 is a type I transmembrane protein 

belonging to the CD2 subgroup of the immunoglobulin 
superf amily . 

The TANGO 195 of SEQ ID NO: 5 has three potential 
N-glycosylation sites (amino acids 85-88, 100-103, and 

10 156-159 of SEQ ID N0:5); three potential protein kinase C 
phosphorylation sites (amino acids 163-165, 230-232, and 
308-310 of SEQ ID N0:5); three potential casein. kinase II 
phosphorylation sites (amino acids 168-171, 215-218, and 
230-233 of SEQ ID NO: 5) ; one potential tyrosine kinase 

15 phosphorylation site (amino acids 65-72 of SEQ ID N0:5); 
one potential cGMP- dependent protein kinase 
phosphorylation site (amino acids 165-168 of SEQ ID 
N0:5;; and three potential N-myristoylation sites (amino 
acids 66-71, 110-115, and 183-188 of SEQ ID N0:5). 

20 Figure 4 is a hydropathy plot of the TANGO 195 of 

SEQ ID NO: 5. Relative hydrophobicity is shown above the 
line marked "0", and relative hydrophilicity is shown 
belov; the line marked "0". 

Figure 7 depicrs the cDNA sequence (SEQ ID NO: ; 

2 5 and predicted amino acid sequence (SEQ ID NO: ) of 

murine TANGO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 876, inclusive (SEQ 

ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: ) 

3 0 and predicted amino acid sequence (SEQ ID NO: ) of a 

partial human TANGO 195 clone (T195Athpb93f 1 ) . The open 
reading frame° extends from nucleotide 159 to lllS, 

inclusive (SEQ ID NO: ) . 

Figure 9 depicts the cDNA sequence (SEQ ID NO: ) 

3 5 and predicted amino acid sequence (SEQ ID NO: ) of a 
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hours) . The strongest expression was seen in monocytes 
activated with gIFN (0.2 or 2ug/ml) giving 2 bands of 
approximately 1.5-2 kb and 3-3.5 kb (with the upper band 
stronger) . Addition of LPS did not increase expression 
5 in monocytes or PBMCs and may in fact lead to a slight 
decrease in expression in gIFN stimulated monocytes. 

TANGO 195 function was investigated by 
reconstituting irradiated mice with bone marrow cells 
infected with retrovirus expressing either full length 

10 murine TANG0195 ("T195fl") or the extracellular domain of 
murine TANGO 195 ("T195ex"). The donor and recipient 
mice were C57BL/6 and congenic for CD45 (CD45.1 for 
donor, CD45.2 for recipient). Recipient mice were 
analyzed approximately 10 and 14 weeks after 

15 transplantation for blood chemistry, hematology and 

tissue histology. Peripheral blood, spleen, lymph node 
and thymus cells were analyzed by FACs analysis and TANGO 
195 RNA levels were analyzed in spleen. The percentage of 
infected cells, based on the percentage of G418 -resistant 

20 donor cells, was 54% (T195fl) and 69% (T195ex) . The level 
of TANGO 195 RNA expression in the spleen of recipient 
mice, based upon slot blot analysis (GAPDH control) , was 
10 cimes chat of control mice (328% of GAPDH for T195ex 
and 232% of GAPDH for T195fl compared co 33% of GAPDH m 

25 control mice) . 

Expression of T195ex led to a statistically 
significant increase in triglyceride levels (ave = 97.6 
mg/dl at 12 weeks; ave =94.3 mg/dl at 16 weeks) in serum 
as compared to control mice (ave =52.5 mg/dl at 12 week; 

30 ave = 62.1 mg/dl 16 weeks) or mice expressing T195fl (ave 
= 47.9 mg/dl at 12 weeks; ave = 67.5 mg/dl at 16 weeks). 

Expression of T195fl had a variety of effects on 
lymphocytes. In T195fl expressing mice, FACs analysis of 
peripheral blood showed an increase in total B220hi 
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However, a probe based on the open reading frame did not 
detect expression in lung or stomach. 

Additional Northern analysis revealed that TANGO 
195 is expressed in activated human monocytes/macrophages 
and, at lower level, in activated human lymphocytes. 
This analysis also revealed that cytokine induced 
differentiation of T cells appears to regulate TANGO 195 
expression. To carry out this analysis, PBMCs were 
purified from human buffy coat by ficol gradient 
ce-ntrifugation as per manufacturers instructions (Sigma) . 
PBMCs were activated for 24 hours with lug/ml LPS. Using 
Miltenyi Biotech positive selection beads, CD4+, CD8+ and 
CD19+ cells were isolated from resting PBMCs. The CD8+ 
cells were activated for 24 hours using plate bound anti- 
CD3 . Resting monocytes were purified from PBMCs by 
gradient centrif ugation and stimulated with 0, 0.01, 0.1 
or lug/ml LPS either with or without 2ng/ml gIFN for 4 
hours or with 0, 0.2 or 2 ng/ml gIFN for 4 hours. 
Activated T cells (predominantly CD4+ cells)' were 
prepared from PBMCs by stimulation with plate bound anti- 
CD3 (TR66) for 3 days in RPMI 10% FBS . Cells were then 
expanded without anti-CD3 but in the presence of IL2 for 
22 days. Cells v;ere resuspended at 107/ml in fresh RPMI 
10% FBS with no exogenous cytokines, or ILIO (20ng/ml) 
plus IL4 (40ng/ml) or TNFa (5,000u/ml) plus gIFN 
(15ng/ml) . (All cytokines were purchased from Genzyme . ) 
Cells were harvested after 8 and 24 hours. RNA was 
prepared from all cell types using RNeasy mini kit 
(Qiagen) and expression was analyzed by standard Northern 
analysis using approx lOug total RNA. No expression was 
seen in resting leukocytes PBMCs, CD4-r, CD8-f, CD19+ or 
monocytes. A single band of approximately 3-3.5 kb was 
seen in activated CD8+ cells and in T cells activated 
with no exogenous cytokines or the combination of ILIO 
and IL4, but not with TNFa and gIFN (after both 3 and 24 
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These results also suggest that TANGO 195 may pla^ 
a role in B cell leukemia; immune response, and 
autoimmune disorders (e.g., arthritis). 

TANGO 195 maps to human chromosome locus hulq21. 
5 The flanking markers are AFMA323ZE5 and D1S2635. The 
among identified loci in close proximity to TANGO 195 are 
HYPLPl (hyperlipidemial) and LPDl (lipodystrophy) . 
Nearby known human genes include: SPTAl (spectrin, 
alpha) , THBS3 ( thrombospondin 3) , MTX (metaxin) , CTSS 

10 (cathepsin K,S), FLG (f ilaggrin) , PKLR (pyruvate kinase), 
HYPLIPl (hyperlipidemia) . 

The mouse chromosome corresponding to the human 
chromosomal locus is chromosome 3 . Nearby mouse loci 
include: soc (soft coat), hyplipl (hyperlipidemia), ft 

15 (flaky tail) and ma (matted) . Nearby mapped mouse genes 
include: Imna (lamin A), fig (f ilaggrin) , bean 
(brevican) , gba (acid beta glucosidase) . 

Rabbit polyclonal antibodies were raised against 
three peptides from, murine TANGO 195 peptides to amino 

20 acids 26-34, 102-117 and 161-176. Peptide purified sera 
from rabbits immunized with amino acids 102-117 
specifically recognizes mouse T195-hFc protein by 
scandarad Western Blot. Additionally unpurified sera 
from rabbits immunized with amino acids 102-117 recognize 

25 mouse T195-hFc by ELISA. ELISA plates were coated with 5 
Tg/ml mouse T195-hFc or human Ig control in PBS overnight 
at 4*C. Plates were washed and blocked with PBS 1% PSA. 
Serial dilutions of serum were added and incubated for 
approximately 2 hours at room temperature. Plates were 

3 0 washed and rabbit immunoglobulin detected with anti- 
rabbit Ig-HRP. Serum from rabbits immunized with the 
peptide corresponding to amino acids 102-117 showed 
greater than 20 fold higher tirres against mouse T195-hFc 
compared to human Ig, and showed greater than 20 fold 
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(CD45Rhi) , IgD+ B cells compared to control mice or mice 
expressing T195ex. The B220hi (CD45Rhi) , IgD+ B cells 
showed low levels of expression of Macl which is 
generally regarded as a marker for cells of the 
5 monocyte/macrophage lineage. FACs analysis of peripheral 
blood showed a slight decrease in CD4-i- T cells in T195fl 
expressing mice compared to control mice and T195ex 
expressing mice. FACs analysis of spleen and lymph node 
cells showed a similar increase in B220+ IgD+ Macllo 

10 cells. These cells had similar levels of surface IgD and 
IgM compared to B220+ cells from control and T195ex mice. 
CD45.1 staining confirmed that the B220-r IgD+ Macllo 
cells in spleen, lymph nodes and peripheral blood were 
derived from donor bone marrow. 

15 In T195fl expressing mice, FACs analysis of 

peritoneal lavage cells showed an increase in total B220- 
cells, an increase in B220+ CD23lo cells, and an increase 
in B220+ Macllo cells compared to control mice of T195ex 
expressing mice. B22 0+ cells showed slightly lower 

2 0 levels of IgD compared to B220+ cells from control mice 
or T195ex expressing mice. In addition, B220+ CD23lo 
cells in T195fl expressing mice showed significantly 
lower expression of B220 compared to control and T195ex 
mice. Finally, it was observed that very few CD5^ B220-- 

2 5 were seen on peritoneal lavage from T195fl, T195ecd or 

control mice . 

Taken together, these results suggest that 
increased expression of T195fl in bone marrow derived 
cells leads to an increase in Bl like cells in the 

3 0 periphery. Since Bl cells are not normally bone marrow 

derived the exact lineage of the cells is difficult to 
determine however they appear to be Bib cells. These 
results also suggest that increased expression of T195fl 
on bone marrow derived cells may lead to a decrease in 
35 CD4+ T cells m the periphery. 
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TANGO 195 expression or activity may be useful in the 
treatment of diorders associated with abberrant B cell 
expansion or differentiation or abberrant cytokine 
production, e.g., allergic and autoimmune disorders. 

5 TABLE 1: Summary of Human TANGO 191 and TANGO 195 



Sequence Information 



Gene 


cDNA 


ORF 


Protein 


Figure 


Accession 
No. 


TANGO IS'^ 


SEQ ID N0;1 


SEQ ID NO: 3 


SEQ ID NO: 2 


Fig. 1 




TANGO 195 
(partial) 


SEQ ID NO: 4 


SEQ ID NO: 6 


SEQ ID NO: 5 


Fig. 2 




TANGO 195 
{partial » 


SEQ ID NO: 


SEQ ID NO: 


SEQ ID NO: 


Fig. 8 




TANGO 195 
full-length 


SEQ ID NO: 


SEQ ID NO: 


SEQ ID NO: 


Fig. 9 





15 TABLE 2: Summary of Domains of TANGO 191 and TANGO 195 



ProLem 


Signal 
Sequence 


Mature 
Protein 


Extraceiiula 
Domain 


TransmemjDrane 
Domain 


Cytcciasmic 
Domain 

i 


TANGO 191 
SEQ ID NO: 2 


aa 1-19 
SEQ ID NO: 7 


aa 20-599 
SEQ ID NO: 8 


aa 20-357 
SEQ ID NO: 9 


aa 358-382 
SEQ ID NO: 10 


aa _-.b3-599 
SEQ ID NO: 11 


TANGO 195 
SEQ ID NO: 5 


aa 1-22 

SEQ ID 
NO: 12 


aa 23-312 
SEQ ID MC:X3 


aa 20-233 
SEQ ID NO: 14 


aa 234-254 
SEQ ID NO: 15 


aa 255-312 
SEQ ID NO: 16 



Various aspects of the invention are described in 
further detail in the following subsections 
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higher titres against mouse T195-hFc compared to control 
serum. 

Several TANGO 195 /immunoglobulin constant region 
fusion proteins were created. Using human TANGO 195 a 
5 fusion protein consisting of TANGO 195 (aal-233) - 
AAPGGASYKD- human IgGlfc was created. A second human 
TANGO 195 fusion substituted murine IgGlfc for human 
IgGlfc. Using murine TANGO 195 a fusion protein 
consisting of TANGO 195 (aal to 231) -AASGKASYKD- human 

10 IgGlfc was created. A second murine TANGO 195 fusion 
protein substituted murine IgGlfc for human IgGlfc. 

A clone (EpjthPb093fol) containing a 1.3 kb cDNA 
encoding apparently full-length TANGO 195 in pMET7 
between NotI and Sail was deposited with the American 

15 Type Culture Collection, Manassas, VA on September 9, 

1998, and assigned Accession Number . 

TANGO 195 has regions that are significantly 
similar to human signalling lymphocyte activation 
molecule ("SLAM") (Accession Number U33017) . For 

20 example, the region of TANGO 195 from amino acid 173 to 
amino acid 250 has 32% identity (25/73 amino aicds) and 
50% identity (39/59 amino acids) to the correspcnding 
region of SLAM; the region of TANGO 195 from amino acid 
134 to amino acid 164 has 32% identity (10/31 amino 

25 acids) and 41% identity (13/31 amino acids) to the 

corresponding region of SLAM; and the region of TANGO 195 
from amino acid 117 to amino acid 132 has 43% identity 
(7/16 amino acids) and 75% identity (12/16 amino acids) 
to the corresponding region of SLAM (Figure 5) . 

30 SLAM is thought to enhance the expansion and 

differentiation of activated B cells (Punnonen et al . 

(1997) J. Exp. Med. 185:993-1004) and m the regulation 
of cype 1 and type 2 cytokine production (Ferrante et al . 

(1998) J. Immunology 160:1514-21). TANGO 195 likely has 
35 a function similar to that of SLAM. Thus, modulators of 
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A nucleic acid molecule of the present invention, 
e.g.; a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1, 3, 4, or 6, the cDNA of ATCC 

s or the cDNA of ATCC , or a complement 

5 thereof, can be isolated using standard molecular biology 
techniques and the sequence information provided herein. 
Using all or a portion of the nucleic acid sequences of 

SEQ ID NO: 1, 3, 4, or 6 , the cDNA of ATCC , or the 

cDNA of ATCC as a hybridization probe, nucleic 

10 acid molecules of the invention can be isolated using 
standard hybridization and cloning techniques (e.g., as 
described in Sambrook et al . , eds., Molecular Cloning: A 
Laboratory Manual, 2nd ed.. Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 

15 Spring Harbor, NY, 1989) . 

A nucleic acid molecule of the invention can be 
amplified using cDNA, mRNA or genomic DNA as a template 
and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid 

20 so amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a 
nucleic acid molecule of the invention can be prepared by 
standard synthetic techniques, e.g., using an automated 

2 5 DNA synthesizer. 

In another preferred embodiment, an isolated nucleic 
acid molecule of the invention comprises a nucleic acid 
molecule which is a complement of the nucleotide sequence 

of SEQ ID N0:1, 3, 4, or 6 , the cDNA of ATCC , or 

30 the cDNA of ATCC , or a portion thereof. A nucleic 

acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently 
complementary to the given nucleotide sequence that it 
can hybridize to the given nucleotide sequence thereby 

3 5 forming a stable duplex. 
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I, Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode a polypeptide of the 
invention or a biologically active portion thereof, as 
5 well as nucleic acid molecules sufficient for use as 
hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of 
such nucleic acid molecules suitable for use as PGR 
primers for the amplification or mutation of nucleic acid 

10 molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 

15 stranded or double -stranded, but preferably is double- 
scranded DNA. 

An "isolated" nucleic acid molecule is one which is 
separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid 

20 molecule. Preferably, an "isolated" nucleic acid 
molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic 
acid (i.e., sequences located at the 5' and 3' ends of 
the nucleic acid) in the genomic DNA of the organism from 

2 5 which the nucleic acid is derived. For example, in 

various embodiments, the isolated nucleic acid molecule 
can contain less than about 5kB, 4kB, 3kB, 2kB, IkB, 
0.5 kB or 0.1 kB of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the 

30 cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors 

35 or other chemicals when chemically synthesized. 
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A nucleic acid fragment encoding a "biologically 
active portion" of a polypeptide of the invention can be 
prepared by isolating a portion of any of SEQ ID NO: 3 or 

6, the nucleotide sequence of the cDNA of ATCC , or 

5 the nucleotide sequence of the cDNA of ATCC which 

encodes a polypeptide having a biological activity, 
expressing the encoded portion of the polypeptide protein 
(e.g., by recombinant expression in vitro) and assessing 
the activity of the encoded portion of the polypeptide. 

10 The invention further encompasses nucleic acid 

molecules that differ from the nucleotide sequence of SEQ 

ID N0:1, 3, 4, or 6 , the cDNA of ATCC , or the cDNA 

of ATCC due to degeneracy of the genetic code and 

thus encode the same protein as that encoded by the 

15 nucleotide sequence of SEQ ID N0:1, 3, 4, or 6, the cDNA 

of ATCC , or the cDNA of ATCC , 

In addition to the nucleotide sequences shown in SEQ 

ID NO: 3 and 6 and present in the cDNA of ATCC and 

the cDNA of ATCC , it will be appreciated by those 

20 skilled in the art that DNA sequence polymorphisms that 
lead to changes in the amino acid sequence may exist 
within a population (e.g., the human population). Such 
genetic polymorphisms may exist among individuals within 
a population due to natural allelic variation. An allele 

25 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the phrase "allelic 
variant" refers to a nucleotide sequence which occurs at 
a given locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 

30 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide 
of the invention. Such natural allelic variations can 
typically result in 1-5% variance in the nucleotide 
sequence of a given gene. Alternative alleles can be 

3 5 identified by sequencing the gene of interest in a number 
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Moreover, a nucleic acid molecule of the invention 
can comprise only a portion of a nucleic acid sequence 
encoding a full length polypeptide of the invention for 
example, a fragment which can be used as a probe or 
5 primer or a fragment encoding a biologically active 

portion of a polypeptide of the invention. The nucleotide 
sequence determined from the cloning one gene allows for 
the generation of probes and primers designed for use in 
identifying and/or cloning homologues in other cell 

10 types, e.g., from other tissues, as well as homologues 
from other mammals. The probe/primer typically comprises 
substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 

15 conditions to at least about 12, preferably about 25, 
more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or ant i- sense strand of SEQ ID N0:1, 3, 4, or 6, the cDNA 
ATCC , or the cDNA of ATCC or of a naturally 

20 occurring mutant of SEQ N0:1, 3, 4, or 6, the cDNA of 

ATCC , or the cDNA of ATCC . 

Probes based on the sequence of a nucleic acid 
molecule of the invention can be used to detect 
transcripts or genomic sequences encoding the same 

25 protein molecule encoded by a selected nucleic acid 
molecule. The probe comprises a label group attached 
thereto, e.g., a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co- factor. Such probes can be used 
as part of a diagnostic test kit for identifying cells or 

30 tissues which mis-express the protein, such as by 

measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., 
detecting mRNA levels or determining whether a gene 
encoding the protein has been mutated or deleted. 
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As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
5 identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989) , 
6.3.1-6.3.6. A preferred, non-limiting example of 

10 stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 
50-65°C. Preferably, an isolated nucleic acid molecule 
of the invention that hybridizes under stringent 

15 conditions to the sequence of SEQ ID N0:1, 3, 4 , or 6, 

the cDNA of ATCC , or the cDNA of ATCC , or the 

complement thereof, corresponds to a naturally-occurring 
nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 

20 molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants 
of a nucleic acid molecule of the invention sequence that 
may exist in the population, the skilled artisan will 

2 5 further appreciate that changes can be introduced by 

mutation thereby leading to changes in the amino acid 
sequence of the encoded protein, without altering the 
biological activity of the protein. For example, one can 
make nucleotide substitutions leading to amino acid 
30 substitutions at "non-essential" amino acid residues. A 
"non-essential" amino acid residue is a residue that can 
be altered from the wild- type sequence without altering 
the biological activity, whereas an "essential" amino 
acid residue is required for biological activity. For 

3 5 example, amino acid residues that are not conserved or 
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of different individuals. This can be readily carried 
out by using hybridization probes to identify the same 
genetic locus in a variety of individuals. Any and all 
such nucleotide Variations and resulting amino acid 
5 polymorphisms or variations that are the result of 
natural allelic variation and that do not alter the 
functional activity are intended to be within the scope 
of the invention. 

Moreover, nucleic acid molecules encoding proteins 

10 of the invention from other species (homologues) , which 
have a nucleotide sequence which differs from that of the 
protein described herein are intended to be within the 
scope of the invention. Nucleic acid molecules 
corresponding to natural allelic variants and homologues 

15 of a cDNA of the invention can be isolated based on their 
identity to the nucleic acid molecule disclosed herein 
using a cDNA, or a portion thereof, as a hybridization 
probe according to standard hybridization techniques 
under stringent hybridization conditions. For example, a 

2 0 cDNA encoding a soluble form of a membrane -bound protein 

of the invention isolated based on its hybridization to a 
nucleic acid molecule encoding all or part of the 
membrane -bound form. Likewise, a cDNA encoding a 
membrane -bound form can be isolated based on its 

25 hybridization to a nucleic acid molecule encoding all or 
pare of the soluble form. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

30 800, 900, 1000, or 1290) nucleotides in length and 

hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence, preferably 
the coding sequence, of SEQ ID N0:1, 3, 4, or 6, the cDNA 
of ATCC , the cDNA of ATCC , or a complement 

3 5 thereof. 
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acids with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 
5 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) . 

10 Alternatively, mutations can be introduced randomly along 
all or part of the coding sequence, such as by 
saturation mutagenesis, and the resultant mutants can be 
screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 

15 protein can be expressed recombinantly and the activity 
of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that 
is a variant of a polypeptide of the invention can be 
assayed for: (1) the ability to form protein : protein 

20 interactions with proteins in a signalling pathway of the 
polypeptide of the invention; (2) the ability to bind a 
ligand of the polypeptide of the invention; or (3) the 
ability to bind to an intracellular target protein of the 
polypeptide of the invention. In yet another preferred 

25 embodiment, the mutant polypeptide can be assayed for the 
ability to modulate cellular proliferation or cellular 
differentiation. 

The present invention encompasses antisense nucleic 
acid molecules, i.e., molecules which are complementary 

30 to a sense nucleic acid encoding a polypeptide of the 

invention, e.g., complementary to the coding strand of a 
double- stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 

3 5 nucleic acid can be complementary to an entire coding 
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only semi -conserved among homologues of various species 
may be non-essential for activity and thus would be 
likely targets for alteration. Alternatively, amino acid 
residues that are conserved among the homologues of 
5 various species (e.g., murine and human) may be essential 
for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding a polypeptide 

10 of the invention that contain changes in amino acid 
residues that are not essential for activity. Such 
polypeptides differ in amino acid sequence from SEQ ID 
NO: 2, 5, 8, and 13 yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule 

15 includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 
45% identical, 65%, 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of any of SEQ ID NO: 2, 5, 8, or 
13. 

20 An isolated nucleic acid molecule encoding a variant 
protein can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID N0:1, 3, 4, or 6, the cDNA 
of ATCC , or the cDNA of ATCC such that one 

2 5 or more amino acid substitutions, additions or deletions 

are introduced into the encoded protein. Mutations can 
be introduced by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are 

3 0 made at one or more predicted non-essential amino acid 

residues. A "conservative amino acid substitution" is 
one m which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families 
of amino acid residues having similar side chains have 
35 been defined in the art. These families include amino 
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met hoxycarboxymethyl uracil . 5-methoxyuracil , 2 - 
methylthio-N6 - isopentenyladenine , uracil - 5 -oxyacet ic acid 
(v) , wybutoxosine, pseudouracil , queosine, 2- 
thiocytosine , 5 -methyl-2 -thiouracil , 2 - thiouracil , 4 - 
thiouracil, S-methyluracil , uracil -5 -oxyacet ic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)W; and 2 , 6-diaminopurine . Alternatively, the 
antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a selected 
polypeptide of the invention to thereby inhibit 
expression, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, 
for example; in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An 
example of a route of administration of antisense nucleic 
acid molecules of the invention includes direct injection 
at: a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
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strand, or to only a portion thereof, e.g., all or part 
of the protein coding region (or open reading frame) . An 
antisense nucleic acid molecule can be antisense to all 
or part of a noncoding region of the coding strand of a 
5 nucleotide sequence encoding a polypeptide of the 
invention. The noncoding regions ("5' and 3' 
untranslated regions") are the 5' and 3' sequences which 
flank the coding region and are not translated into amino 
acids . 

10 An antisense oligonucleotide can be, for example, 

about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides 
in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 

15 For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 

20 stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5- 

25 f luorouracil , 5-bromouracil , 5-chlorouracil , 5- 

iodouracil, hypoxanthine, xanthine, 4- acetyl cytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl -2 - thiouridine , 5- 
carboxymethylaminomethyluracil , dihydrouracil , beta-D- 

30 galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine , 1 -methyl inosine , 2,2 -dimethyl guanine , 2 - 
methyladenine , 2 -methylguanine , 3 -methylcytosine , 5 - 
methylcytosine, N6- adenine, 7 -methylguanine, 5-- 
me t hy 1 ami nome thyluracil, 5 - met hoxy ami nome t hy 1 - 2 - 

35 thiouracil, beta-D-mannosylqueos.ine, 5'- 
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Alternacively, an niRNA encoding a polypeptide of the 
invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA 
molecules. See, e.g., Bartel and Szostak (1993) Science 
5 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, expression of a polypeptide of the invention can 
be inhibited by targeting nucleotide sequences 

10 complementary to the regulauory region of the gene 
encoding the polypeptide (e.g., the promoter and/or 
enhancer) to form triple helical structures that prevent 
transcription of the gene in target cells, See generally 
Helene (1991) Anticancer Drug Des. 6(6) : 569-84; Helene 

15 (1992) Ann. N.Y. Acad. Sex, 660:27-36; and Maher (1992) 
Bioassays 14 (12) :807-15. 

In preferred embodiments, the nucleic acid molecules 
of the invention can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the 

20 stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al . (1996) Bioorganic & Medicinal 
Chemistry 4(1) : 5-23) . As used herein, the terms 

25 "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 

30 allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996), supra; Perry-0' Keef e et al . (1996) Proc. Natl. 

35 Acad. Sci. USA 93: 14670-675. 
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antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
5 antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the invention 
can be an a-anomeric nucleic acid molecule. An a- 

10 anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual jS-units, the strands run parallel 
to each other (Gaultier et al . (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can 

15 also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 
330) . 

The invention also encompasses ribozymes. Ribozymes 

20 are catalytic RNA molecules with ribonuclease activity 
which are capable of cleaving a single -stranded nucleic 
acid, such as an mRNA, to which they have a complementary 
region- Thus, ribozymes (e.g., hammerhead ribozymes 
(described -in Haselhoff and Gerlach (1988) Nature 

25 334:585-591)) can be used to catalytically cleave mRNA 

transcripts to thereby inhibit translation of the protein 
encoded by the mRNA. A ribozyme having specificity for a 
nucleic acid molecule encoding a polypeptide of the 
invention can be designed based upon the nucleotide 

3 0 sequence of a cDNA disclosed herein. For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence 
to be cleaved in a Cech et al. U.S. Patent No. 4,987,071; 

3 5 and Cech et al . U.S. Patent No. 5,116,742. 
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PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Res. 
24 (17) : 3357-63) . Alternatively, chimeric molecules can 
5 be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al . (1975) Bioorganic Med, Chem. Lett. 
5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g,, for 

10 targeting host cell receptors in vivo) , or agents 

facilitating transport across the cell membrane {see, 
e.g., Letsinger et al . (1989) Proc, Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCX Publication No. WO 88/09810) or 

15 the blood-brain barrier (see, e.g., PCT Publication No. 
WO 89/10134) . In addition, oligonucleotides can be 
modified with hybridization-triggered cleavage agents 
(see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents {see, e.g., Zon (1988) Pharm. 

20 Res. 5:539-549) . To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Proteins and Antibodies 

2 5 One aspect of the invention pertains to isolated 

proteins and polypeptides of the invention, and 
biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to 
raise antibodies directed against a polypeptide of the 

3 0 invention. In one embodiment, the native polypeptide can 

be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein 
purification techniques. In another embodiment, 
polypepcides of the invention are produced by recombinant 
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PNAs can be used in therapeutic and diagnostic 
applications. For example, PNAs can be used as antisense 
or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or 
5 translation arrest or inhibiting replication. PNAs can 
also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; 
as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup 

10 (1996), supra; or as probes or primers for DNA sequence 
and hybridization {Hyrup (1996), supra; Perry-0' Keef e et 
al. (1996) Proc. Natl. Acad. Sci , USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., 
to enhance their stability or cellular uptake, by 

15 attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of 
liposomes or other techniques of drug delivery known in 
the art. For example, PNA-DNA chimeras can be generated 
which may combine the advantageous properties of PNA and 

20 DNA. Such chimeras allow DNA recognition enzymes, e.g., 
RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of 

25 base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996) , supra) . The synthesis of 
PNA-DNA chimeras can be performed as described in Hyrup 
(1996), supra, and Finn et al . (1996) Nucleic Acids Res. 
24(17) : 3357-63. For example, a DNA chain can be 

30 synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified 
nucleoside analogs. Compounds such as 5' -(4-, 
methoxycrityl) amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 

35 DNA (Mag et al . (1989) Nucleic Acids Res. 17:5973-88). 
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fewer amino acids than the full length protein, and 
exhibit at least one activity of the corresponding full- 
length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 
5 the corresponding protein. A biologically active portion 
of a protein of the invention can be a polypeptide which 
is, for example, 10, 25, 50, 100 or more amino acids in 
length. Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

10 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of the native form 
of a polypeptide of the invention. 

Preferred polypeptides have the amino acid sequence 
of SEQ ID N0:2, 5, 7-11, and 12-16. Other useful 

15 proteins are substantially identical (e.g., at least 

about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99%) to 
SEQ ID NO: 2, 5, 7-11, and 12-16 and retain the functional 
activity of the protein of the corresponding naturally- 
occurring protein yet differ in amino acid sequence due 

20 to natural allelic variation or mutagenesis . 

To determine the percent identity of two amino acid 
sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 

25 nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 

3 0 residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 

35 of idencical positions/total # of positions (e.g., 
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DNA techniques. Alternative to recombinant expression, a 
polypeptide of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 
An "isolated" or "purified" protein or biologically 
5 active portion thereof is substantially free of cellular 
material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. The language 

10 "substantially free of cellular material" includes 

preparations of protein in which the protein is separated 
from cellular components of the cells from which it is 
isolated or recombinantly produced. Thus, protein that 
is substantially free of cellular material includes 

15 preparations of protein having less than about 30%, 20%, 
10%, or 5% (by dry weight) of heterologous protein (also 
referred to herein as a "contaminating protein") . When 
the protein or biologically active portion thereof is 
recombinantly produced, it is also preferably 

20 substantially free of culture medium, i.e., culture 

medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When the protein is 
produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 

25 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the procein. Accordingly such preparations of the 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or compounds other than 

3 0 the polypeptide of interest. 

Biologically active portions of a polypeptide of the 
invention include polypeptides comprising amino acid 
sequences sufficiently identical to or derived from the 
amino acid sequence of the protein (e.g., the amino acid 

3 5 sequence of SEQ ID NO: 2, 5, 8, or 13) , which include 
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table, a gap length penalty of 12, and a gap penalty of 4 
can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
5 above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides chimeric or fusion 
proteins. As used herein, a "chimeric protein" or 
"fusion protein" comprises all or part (preferably 

10 biologically active) of a polypeptide of the invention 
operably linked to a heterologous polypeptide (i.e., a 
polypeptide other than the same polypeptide of the 
invention) . Within the fusion protein, the term 
"operably linked" is intended to indicate that the 

15 polypeptide of the invention and the heterologous 
polypeptide are fused in- frame to each other. The 
heterologous polypeptide can be fused to the N- terminus 
or C- terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in 

20 which the polypeptide of the invention is fused to the C- 
terminus of GST sequences. Such fusion proteins can 
facilitate the purification of a recombinant polypeptide 
of the invention. 

In another embodiment, the fusion protein contains a 

25 heterologous signal sequence at its N- terminus. For 
example, the native signal sequence of a polypeptide of 
the invention can be removed and replaced with a signal 
sequence from another protein. For example, the gp67 
secretory sequence of the baculovirus envelope protein 

30 can be used as a heterologous signal sequence {Current 
Protocols in Molecular Biology, Ausubel et al . , eds . , 
John Wiley & Sons, 1992) . Other examples of eukaryotic 
heterologous signal sequences include the secretory 
sequences of melittin and human placental alkaline 

3 5 phosphatase (Stratagene; La Jolla, California) . In yet 
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overlapping positions) x 100) . Preferably; the two 
sequences are the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
5 algorithm. A preferred, non-limiting example of a 

mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karl in and Altschul (1990) 
Proc, Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karl in and Altschul (1993) Proc. Natl. Acad. Sci. USA 

10 90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J. 
Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 

15 to a nucleic acid molecules of the invention. BLAST 
protein searches can be performed with the XBLAST 
program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to a protein molecules of the 
invention. To obtain gapped alignments for comparison 

20 purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 
Alternatively, PSI-Blast can be used to perform an 
iterated search which detects distant relationships 
between molecules. Jd. When utilizing BLAST, Gapped 

25 BLAST, and PSI-Blast programs, the default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another 
preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the 

30 algorithm of Myers and Miller, (1.988) CABIOS 4:11-17. 

Such an algorithm is incorporated into the ALIGN program 
(version 2.0) vyhich is part of the GCG sequence alignment 
software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM12 0 weight residue 
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sequence (see, e.g., Ausiibel et al . , supra). Moreover, 
many expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention 
5 can be cloned into such an expression vector such that 
the fusion moiety is linked in-frame to the polypeptide 
of the invention. 

A signal sequence of a polypeptide of the invention 
(SEQ ID NO: 7 or 12) can be used to facilitate secretion 

10 and isolation of a secreted protein or other protein of 
interest. Signal sequences are typically characterized 
by a core of hydrophobic amino acids which are generally 
cleaved from the mature protein during secretion in one 
or more cleavage events. Such signal peptides contain 

15 processing sites that allow cleavage of the signal 

sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to 
the described polypeptides having a signal sequence, as 
well as to the signal sequence itself and to the 

20 polypepcide in the absence of the signal sequence (i.e., 
the cleavage products) . In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention 
can be operably linked in an expression vector to a 
protein of interest, such as a protein which is 

25 ordinarily not secreted or is otherwise difficult to 
isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the 
expression vector is transformed, and the signal sequence 
is subsequently or concurrently cleaved. The protein can 

30 then be readily purified from the extracellular medium by 
art recognized methods. Alternatively, the signal 
sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a 
GST domain. 
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another example; useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et 
. al . , supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 
5 In yet another embodiment, the fusion protein is an 

immunoglobulin fusion protein in which all or part of a 
polypeptide of the invention is fused to sequences 
derived from a member of the immunoglobulin protein 
family. The immunoglobulin fusion proteins of the 
10 invention can be incorporated into pharmaceutical 

compositions and administered to a subject to inhibit an 
interaction between a ligand (soluble or membrane -bound) 
and a protein on the surface of a cell (receptor) , to 
thereby suppress signal transduction in vivo. The 
15 immunoglobulin fusion protein can be used to affect the 
bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction 
may be useful therapeutically, both for treating 
proliferative and dif f erentiative disorders and for 
20 modulating (e.g. promoting or inhibiting) cell survival. 
Moreover, the immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce antibodies 
directed against a polypeptide of the invention in a 
subject, to purify ligands and in screening assays to 
25 identify molecules which inhibit the interaction of 
receptors with ligands. 

Chimeric and fusion protein of the invention can be 
produced by standard recombinant DNA techniques. In 
another embodiment, the fusion gene can be synthesized by 
3 0 conventional techniques including automated DNA 

synthesizers. Alternatively, PGR amplification of gene 
fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
3 5 annealed and reamplified to generate a chimeric gene 
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invention for agonist or antagonist activity. In one 
embodiment, a variegated library of variants is generated 
by combinatorial mutagenesis at the nucleic acid level 
and is encoded by a variegated gene library. A 
5 variegated library of variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 
degenerate set of potential protein sequences is 
expressible as individual polypeptides, or alternatively, 

10 as a set of larger fusion proteins (e.g., for phage 
display) . There are a variety of methods which can be 
used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate 
oligonucleotide sequence. Methods for synthesizing 

15 degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al. 
(1984) Annu. Rev, Biochem. 53:323; Itakura et al . (1984) 
Science 198:1056; Ike et al . (1983) Nucleic Acid Res, 
11:477) . 

20 In addition, libraries of fragments of the coding 

sequence of a polypeptide of the invention can be used to 

generate a variegated population of polypeptides for 

screening and subsequent selection of variants. For 

example, a library of coding sequence fragments can be 

o 

25 generated by treating a double stranded PGR fragment of 
the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, 

removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting 
fragment: library into an expression vector. By this 
method, an expression library can be derived which 
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In another embodiment, the signal sequences of the 
present invention can be used to identify regulatory- 
sequences , e.g., promoters, enhancers, repressors. Since 
signal sequences are the most amino -terminal sequences of 
5 a peptide, it is expected that the nucleic acids which 
flank the signal sequence on its amino- terminal side will 
be regulatory sequences which affect transcription. 
Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to 

10 identify and isolate signal sequences and their flanking 
regions, and these flanking regions can be studied to 
identify regulatory elements therein, 

The present invention also pertains to variants of the 
polypeptides of the invention. Such variants have an 

15 altered amino acid sequence which can function as either 
agonists (mimetics) or as antagonists. Variants can be 
generated by mutagenesis, e.g., discrete point mutation 
or truncation. An agonist can retain substantially the 
same, or a subset, of the biological activities of the 

2 0 naturally occurring form of the protein. An antagonist 
of a protein can inhibit one or more of the activities of 
the naturally occurring form of the protein by, for 
example, competitively binding to a downstream or 
upstream member of a cellular signaling cascade which 

25 includes the protein of interest. Thus, specific 

biological effects can be elicited by treatment with a 
variant of limited function. Treatment of a subject with 
a variant having a subset of the biological activities of 
the naturally occurring form of the protein can have 

30 fewer side effects in a subject relative to treatment 
with the naturally occurring form of the protein. 

Variants of a protein of the invention which function 
as either agonists (mimetics) or as antagonists can be 
identified by screening combinatorial libraries of 

35 mutants, e.g., truncation mutants, of the protein of the 
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protein, e.g., hydrophilic regions. Figures 2 and 4 are 
hydrophobic ity plots of the proteins of the invention. 
These plots or similar analyses can be used to identify 
hydrophilic regions. 
5 An inimunogen typically is used to prepare antibodies by 
immunizing a suitable subject, (e.g., rabbit, goat, mouse 
or other mammal) . An appropriate immunogenic preparation 
can contain, for example, recombinant ly expressed 
chemically synthesized polypeptide. The preparation can 

10 further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains 
to antibodies directed against a polypeptide of the 
invention. The term "antibody" as used herein refers to 

15 immunoglobulin molecules and immiinologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antigen binding site which specifically 
binds an antigen, such as a polypeptide of the invention. 
A molecule which specifically binds to a given 

20 polypeptide of the invention is a molecule which binds 
the polypeptide, but does not substantially bind other 
molecules in a sample, e.g., a biological sample, which 
naturally contains the polypeptide. Examples of 
immunologically active portions of immunoglobulin 

25 molecules include F(ab) and F{ab')2 fragments which can be 
generated by treating the antibody with an enzyme such as 
pepsin. The invention provides polyclonal and monoclonal 
antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers 

30 to a population of antibody molecules that contain only 
one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described 
above by immunizing a suitable subject with a polypeptide 

35 of the invention as an immunogen. The antibody titer in 
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encodes N- terminal and internal fragments of various 
sizes of the protein of interest. 

Several techniques are known in the art for screening 
gene products of combinatorial libraries made by point 
5 mutations or truncation, and for screening cDNA libraries 
for gene products having a selected property. The most 
widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries 
typically include cloning the gene library into 
10 replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in 
which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product 
15 was detected. Recursive ensemble mutagenesis (REM) , a 
technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with 
the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl, Acad. 
20 Sci, C7SA 89:7811-7815; Delgrave et al . (1993) Protein 
Engineering 6 (3) : 327-331) . 

An isolated polypeptide of the invention, or a fragment 
thereof, can be used as an immunogen to generate 
antibodies using standard techniques for polyclonal and 
25 monoclonal antibody preparation. The full-length 

polypeptide or protein can be used or, alternatively, the 
invention provides antigenic peptide fragments for use as 
immunogens . The antigenic peptide of a protein of the 
invention comprises at least 8 (preferably 10, 15, 20, or 
30 30) amino acid residues of the amino acid sequence of SEQ 
ID NO: 8 or 13 and encompasses an epitope of the protein 
such that an antibody raised against the peptide forms a 
specific immune complex with the protein. 

Preferred epitopes encompassed by the antigenic peptide 
35 are regions that are located on the surface of the 
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particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
5 Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 
92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum, 
10 AntiJbod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J. 12:725- 
734. 

Additionally, ^recombinant antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both 

15 human and non-human portions, which can be made using 

standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques 
known in the art, for example using methods described in 

20 PCT Publication No. WO 87/02671; European Patent 

Application 184,187; European Patent Application 171,496; 
European Patent Application 173,4 94; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent 
Application 125,023; Better et al . (1988) Science 

25 240:1041-1043; Liu et al . (1987) Proc, Natl. Acad. Sci . 
USA 84:3439-3443; Liu et al , (1987) J. Immunol. 
139:3521-3526; Sun et al. (1987) Proc. Natl. Acad. Sci. 
USA 84:214-218; Nishimura et al . (1987) Cane. Res. 
47:999-1005; Wood et al . (1985) J^ature 314:446-449; and 

30 Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al . (1986) 
Bio/Techniques 4:214; U.S. Patent 5,225,53 9; Jones et al . 
(1986) Nature 321:552-525; Verhoeyan et al . (1988) 
Science 239:1534; and Beidler et al . (1988) J. Immunol. 

35 141:4053-4060. 
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the immunized subject can be monitored over time by 
standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized 
polypeptide. If desired, the antibody molecules can be 
5 isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as 
protein A chromatography to obtain the IgG fraction. At 
an appropriate time after immunization, e.g., when the 
specific antibody titers are highest, antibody-producing 
0 cells can be obtained from the s\abject and used to 
prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497, the human 
B cell hybridoma technique (Kozbor et al . (1983) Immunol. 
5 Today 4:72), the EBV-hybridoma technique (Cole et al . 

(1985) , Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing hybridomas is well known (see 
generally Current Protocols in Immunology (1994) Coligan 
0 et al. (eds.) John Wiley & Sons, Inc., New York, NY) . 
Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of 
interest, e.g., using a standard ELISA assay. 
5 Alternative to preparing monoclonal antibody-secreting 
hybridomas, a monoclonal antibody directed against a 
polypeptide of the invention can be identified and 
isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display 
0 library) with the polypeptide of interest. Kits for 
generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP'"* Phage Display Kit, Catalog No. 
5 240612) . Additionally, examples of methods and reagents 
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An antibody directed against a polypeptide of the 
invention (e.g., monoclonal antibody) can be used to 
isolate the polypeptide by standard techniques, such as 
affinity chromatography or immunoprecipitation. 
5 Moreover, such an antibody can be used to detect the 
protein (e.g., in a cellular lysate or cell supernatant) 
in order to evaluate the abundance and pattern of 
expression of the polypeptide. The antibodies can also 
be used diagnostically to monitor protein levels in 

10 tissue as part of a clinical testing procedure, e.g., to, 
for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 

15 groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, iS-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 

20 group complexes include streptavidin/biotin and 

avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 

25 of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include '^^I, "^I, ^^S or ^H. 

III. Recombinant Expression Vectors and Host Cells 
30 Another aspect of the invention pertains to vectors, 
preferably expression vectors, containing a nucleic acid 
encoding a polypeptide of the invention (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 



wo 00/18800 



PCT/US99/22818 



- 47 - 

Completely human antibodies are particularly desirable 
for therapeutic treatment of human patients. Such 
antibodies can be produced using transgenic mice which 
are incapable of expressing endogenous immunoglobulin 
5 heavy and light chains genes, but which can express human 
heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, 
e.g., all or a portion of a polypeptide of the invention. 
Monoclonal antibodies directed against the antigen can be 

10 obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 
mutation. Thus, using such a technique, it is possible 

15 to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. Rev, Immunol, 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and 

20 human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA) , can be 

25 engaged to provide human antibodies directed against a 
selected antigen using technology similar to that 
described above. 

Completely human antibodies which recognize a selected 
epitope can be generated using a technique referred to as 

30 "guided selection." In this approach a selected 

non-human monoclonal antibody, e.g., a murine antibody, 
is used to guide the selection of a completely human 
antibody recognizing the same epitope (Jespers et al, 
(1994) Bio/Technology 12:899-903). 
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the vector is introduced into the host cell) . The term 
"regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., 
polyadenylation signals) . Such regulatory sequences are 
5 described, for example, in Goeddel, Gene Expression 
Technology: Methods in EnzymoSogy 185, Academic Press, 
San Diego, CA (1990) , Regulatory sequences include those 
which direct constitutive expression of a nucleotide 
sequence in many types of host cell and those which 

10 direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory 
sequences) . It will be appreciated by those skilled in 
the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to 

15 be transformed, the level of expression of protein 

desired, etc. The expression vectors of the invention 
can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or 
peptides, encoded by nucleic acids as described herein. 

20 The recombinant expression vectors of the invention can 
be designed for expression of a polypeptide of the 
invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells such as E. coii, insect cells (using 
baculovirus expression vectors) , yeast cells or mammalian 

25 cells. Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant 
expression vector can be transcribed and translated in 
vitro, for example using T7 promoter regulatory sequences 
and T7 polymerase. 

3 0 Expression of proteins in prokaryotes is most often 
carried out m E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 

35 encoded therein, usually to the amino terminus of the 
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nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
5 wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 

10 vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 

15 expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 

20 vectors (e.g., replication defective retroviruses, 

adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 

The recombinant expression vectors of the invention 
comprise a nucleic acid of the invention in a form 

2 5 suitable for expression of the nucleic acid in a host 

cell. This means that the recombinant expression vectors 
include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, which 
is operably linked to the nucleic acid sequence to be 

3 0 expressed. Within a recombinant expression vector, 

"operably linked" is intended to mean that the nucleotide 
sequence of interest is linked to the regulatory 
sequence (s) in a manner which allows for expression of 
the nucleotide sequence (e.g., in an in vitro 
3 5 transcription/translation system or in a host cell when 
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recombinant protein (Gottesman, Gene Expression 
Technology: Methods in Enzymology 185; Academic Press, 
San Diego, California (1990) 119-128) . Another strategy 
is to alter the nucleic acid sequence of the nucleic acid 
5 to be inserted into an expression vector so that the 
individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al . (1992) 
Nucleic Acids Res, 20:2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried 

10 out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in 
yeast S, cerivisae include pYepSecl (Baldari et al . 
(1987) EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 

15 (1982) Cell 30:933-943), pJRY88 (Schultz et al . (1987) 
Gene 54:113-123), pYES2 (Invitrogen Corporation, San 
Diego, CA) , and pPicZ (Invitrogen Corp, San Diego, CA) , 
Alternatively, the expression vector is a baculovirus 
expression vector. Baculovirus vectors available for 

20 expression of proteins in cultured insect cells (e.g., Sf 
9 cells) include the pAc series (Smith et al . (1983) Mol, 
Cell Biol. 3:2156-2165) and the pVL series (Lucklow and 
Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the 

25 invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDMS {Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 

30 vector's control functions are often provided by viral 
regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenoviruis 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 

35 cells see chapters 16 and 17 of Sambrook et al . , supra. 
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recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
5 recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
0 fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
5 pMAL (New England Biolabs, Beverly, MA) and pRITS 

(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E, coli 
0 expression vectors include pTrc (Amann et al . , (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
5 host RNA polymerase transcription from a hybrid trp-lac 
fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
0 strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression 
in E. coli is to express the protein in a host bacteria 
5 with an impaired capacity to proteolyt ically cleave the 
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cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type 
specific expression of antisense RNA. The antisense 
5 expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high 
efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is 

10 introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub et al . 
(-Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells 
into which a recombinant expression vector of the 

15 invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 

20 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein . 

25 A host cell can be any prokaryotic (e.g., E. coli) or 
eukaryotic cell (e.g., an insect cell, yeast, or a 
mammalian cell) . 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 

30 transfection techniques. As used herein, the terms 

"nransf ormation" and "transfection" are intended to refer 
to a variety of arc -recognized techniques for introducing 
foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE- 

35 dextran-mediated cransf ection, lipofection, or 
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In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
5 express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid-specif ic promoters (Calame and 

10 Eaton (1988) Adv. Iminunol . 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 
(1989) EMBO u. 8:729-733) and immunoglobulins {Banerji et 
al . (1983) Ceil 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specif ic promoters (e.g., the 

15 neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl, Acad. Sci , USA 86:5473-5477), pancreas -specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 

20 Application Publication No. 264,166). Development ally - 
regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the of- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

2 5 The invention further provides a recombinant expression 
vector comprising a DNA molecule of the invention cloned 
into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a 
regulatory sequence in a manner which allows for 

30 expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to the mRNA encoding a 
polypeptide of the invention. Regulatory sequences 
operably linked zo a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous 
35 expression of the antisense RNA molecule in a variety of 
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non-human transgenic animals in which exogenous sequences 
encoding a polypeptide of the invention have been 
introduced into their genome or homologous recombinant 
animals in which endogenous encoding a polypeptide of the 
5 invention sequences have been altered. Such animals are 
useful for studying the function and/or activity of the 
polypeptide and for identifying and/or evaluating 
modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a -non-human animal, preferably a 

10 mammal, more preferably a rodent such as a rat or mouse, 
in which one or more of the cells of the animal includes 
a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 
chickens, amphibians, etc. A transgene is exogenous DNA 

15 which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used 

20 herein, an "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 

25 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

A transgenic animal of the invention can be created by 
introducing nucleic acid encoding a polypeptide of the 
invention (or a homologue thereof) into the male 

30 pronuclei of a fertilized oocyte, e.g., by 

microinjection, retroviral infection, and allowing the 
oocyte CO develop in a pseudopregnant female foster 
animal. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the 

35 efficiency of expression of the transgene. A tissue- 
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electroporation . Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al . 
(supra) r and other laboratory manuals. 

For stable transfection of mammalian cells, it is known 
that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygrcmycin and methotrexate. Cells stably 
transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a prokaryotic or 
eukaryotic host cell in culture, can be used to produce a 
polypeptide of the invention. Accordingly, the invention 
further provides methods for producing a polypeptide of 
the invention using the host cells of the invention. In 
one embodiment, the method comprises culturing the host 
cell of invention (into which a recombinant expression 
vector encoding a polypeptide of the invention has been 
introduced) in a suitable medium such that the 
polypeptide is produced. In another embodiment, the 
method further comprises isolating the polypeptide from 
the medium or the host cell. 

The host cells of the invention can also be used to 
produce nonhuman transgenic animals. For example, in one 
embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences 
encoding a polypeptide of the invention have been 
introduced. Such host cells can then be used to create 
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5' and 3' ends by additional nucleic acid of the gene to 
allow for horaologous recombination to occur between the 
exogenous gene carried by the vector and an endogenous 
gene in an embryonic stem cell. The additional flanking 
5 nucleic acid sequences are of sufficient length for 
successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both 
at the 5' and 3' ends) are included in the vector (see, 
e.gr./ Thomas and Capecchi (1987) Cell 51:503 for a 

10 description of homologous recombination vectors) . The 
vector is introduced into an embryonic stem cell line 
(e.g., by electroporation) and cells in which the 
introduced gene has homologously recombined with the 
endogenous gene are selected {see, e.g., Li et al . (1992) 

15 Cell 69:915). The selected cells are then injected into 
a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras (see, e.g., Bradley in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 

2 0 152) . A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 

2 5 contain the homologously recombined DNA by germline 

transmission of the transgene . Methods for constructing 
homologous recombination vectors and homologous 
recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
30 PCT Publication NOS. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can 
be produced which contain selected systems which allow 
for regulated expression of the transgene. One example 

3 5 of such a syscem is the cre/loxP recombinase system of 
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specific regulatory sequence (s) can be operably linked to 
the transgene to direct expression of the polypeptide of 
the invention to particular cells. Methods for 
generating transgeriic animals via embryo manipulation and 
5 microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent NOS . 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold 

10 Spring Harbor, N.Y., 1986) . Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the 

15 animals. A transgenic founder animal can then be used to 
breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying the transgene can 
further be bred to other transgenic animals carrying 
other transgenes. 

20 To create an homologous recombinant animal, a vector is 
prepared which contains at least a portion of a gene 
encoding a polypeptide of the invention into which a 
deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the gene. In 

2 5 a preferred embodiment, the vector is designed such that, 

upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the vector can be designed such 

3 0 that, upon homologous recombination, the endogenous gene 

is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region 
can be altered to thereby alter the expression of the 
endogenous protein) . In the homologous recombination 
35 vector., the altered portion of the gene is flanked at its 
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Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
5 administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 
subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, 

10 intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 

15 methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 

2 0 dextrose. pH can be adjusted with acids or bases, such 

as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use 
25 include sterile aqueous solutions (where water soluble) 
or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or 
dispersions. For intravenous administration, suitable 
carriers include physiological saline, bacteriostatic 

3 0 water, Cremophor EL'" (BASF; Parsippany, NJ) or phosphate 

buffered saline (PBS) . In all cases, the composition 
must be sterile and should be fluid to the extent that 
easy syringabiiity exists. It must be stable under the 
conditions of manufacture and storage and must be 
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bacteriophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al . (1992) Proc. 
Natl. Acad. Sci . USA 89:6232-6236. Another example of a 
recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al . (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used 
to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase 
and a selected protein are required. Such animals can be 
provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described 
herein can also be produced according to the methods 
described in Wilmut et al . (1997) Nature 385:810-813 and 
PCT Publication NOS . WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and 
0 antibodies (also referred to herein as "active 

compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. 
Such compositions typically comprise the nucleic acid 
molecule, protein, or antibody and a pharmaceutically 
5 acceptable carrier. As used herein the language 

"pharmaceutically acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 
0 pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well 
known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. 
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Oral compositions generally include an inert diluent or 
an edible carrier. They can be enclosed in gelatin 
capsules or compressed into tablets. For the purpose of 
oral therapeutic administration, the active compound can 
5 be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can 
also be prepared using a fluid carrier for use as a 
mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

10 Pharmaceutical ly compatible binding agents, and/or 
adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 

15 microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 

2 0 agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring . 

For administration by inhalation, the compounds are 
delivered in the form of an aerosol spray from a 

2 5 pressurized container or dispenser which contains a 

suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

Systemic administration can also be by transmucosal or 
transdermal means. For tftansmucosal or transdermal 

3 0 administration, penetrants appropriate to the barrier to 

be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, 
for example, fcr rransmucosal administration, detergents, 
bile salts, and fusidic acid derivatives. Transmucosal 
3 5 administration can be accomplished through the use of 
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preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for 
example, water, e-ihanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanol , phenol, ascorbic acid, thimerosal, and the 
like. In many cases, it will be preferable to include 
isotonic agents, for example, sugarso polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
composicion an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a polypeptide or 
antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients 
enumeraced above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle 
which contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of 
preparation are vacuum drying and f reeze-drying which 
yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile- 
filtered solution thereof. 
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the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
5 active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 
100 mg/kg of body weight (generally 10 mg/kg to 20 
mg/kg) . If the antibody is to act in the brain, a dosage 
of 50 mg/kg to 100 mg/kg is usually appropriate. 

10 Generally, partially human antibodies and fully human 
antibodies have a longer half-life within the human body 
than other antibodies. Accordingly, lower dosages and 
less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize 

15 antibodies and to enhance uptake and tissue penetration 
(e.g., into the brain). A method for lipidation of 
antibodies is described by Cruikshank et al . ((1997) j". 
Acquired Immune Deficiency Syndromes and Human 
Retrovirology 14:193). 

2 0 The nucleic acid molecules of the invention can be 

inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
25 (see, e.g., Chen et al . (1994) Proc. Natl. Acad, Sci , USA 
91:3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 

3 0 Alternatively, where the complete gene delivery vector 

can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can 
include one or more cells which produce the gene delivery 
system. 
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nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in 
the art . 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared 
with carriers that will protect the compound against 
rapid elimination from the body, such as a controlled 
release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
those skilled m the art. The materials can also be 
obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4 , 522 , 811 . 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
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A. Screening Assays 

The invention provides a method (also referred to 
herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 
5 (e.g., peptides, peptidomimetics, small molecules or 
other drugs) which bind to polypeptide of the invention 
or have a stimulatory or inhibitory effect on, for 
example, expression or activity of a polypeptide of the 
invention. 

10 In one embodiment, the invention provides assays for 
screening candidate or test compounds which bind to or 
modulate the activity of the membrane -bound form of a 
polypeptide of the invention or biologically active 
portion thereof. The test compounds of the present 

15 invention can be obtained using any of the numerous 

approaches in combinatorial library methods known in the 
art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase 
libraries; synthetic library methods requiring 

2 0 deconvolution; the "one -bead one -compound" library 
method; and synthetic library methods using affinity 
chromatography selection. The biological library 
approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide 

25 oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des . 12:145) . 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 
DeWitt et al . (1993) Proc. Natl. Acad. Sci , USA 90:6909; 

30 Erb et al . (1994) Proc, Natl. Acad. Sci, USA 91:11422; 
Zuckermann et al . (1994). J. Med. Chem. 21:2618; Cho et 
al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 
Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J, 

35 Med. Chem. 37:1233. 
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The pharmaceutical compositions can be included in a 
container, pack, or dispenser together with instructions 
for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). For 
example, polypeptides of the invention can to used to (i) 
modulate cellular proliferation; (ii) modulate cellular 
differentiation; and (iii) modulate cell survival. The 
isolated nucleic acid molecules of the invention can be 
used to express proteins (e.g., via a recombinant 
expression vector in a host cell in gene therapy 
applications), to detect mRNA (e.g., in a biological 
sample) or a genetic lesion, and to modulate activity of 
a polypeptide of the invention. In addition, the 
polypeptides of the invention can be used to screen drugs 
or compounds which modulate activity or expression of a 
polypeptide of the invention as well as to treat 
disorders characterized by insufficient or excessive 
production of a protein of the invention or production 
of a form of a protein of the invention which has 
decreased or aberrant activity compared to the wild type 
protein. In addition, the antibodies of the invention 
can be used to detect and isolate a protein of the and 
modulate activity of a protein of the invention. 

This invention further pertains to novel agents 
identified by tne above-described screening assays and 
uses thereof for treatments as described herein. 
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invention, or a biologically active portion thereof, on 
the cell surface with a known compound which binds the 
polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
5 ability of the test compound to interact with the 

polypeptide, wherein determining the ability of the test 
compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or a biologically 
10 active portion thereof as compared to the Icnown compound. 

In another embodiment, an assay is a cell -based assay 
comprising contacting a cell expressing a membrane -bound 
form of a polypeptide of the invention, or a biologically 
active portion thereof, on the cell surface with a test 

15 compound and determining the ability of the test compound 
to modulate (e.g., stimulate or inhibit) the activity of 
the polypeptide or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of the polypeptide or a biologically active 

20 portion thereof can be accomplished, for example, by 
determining the ability of the polypeptide protein to 
bind to or interact with a target molecule. 

Determining the ability of a polypeptide of the 
invention to bind to or interact with a target molecule 

25 can be accomplished by one of the methods described above 
for determining direct binding. As used herein, a 
"target molecule" is a molecule with which a selected 
polypeptide (e.g., a polypeptide of the invention binds 
or interacts with in nature, for example, a molecule on 

30 the surface of a cell which expresses the selected 

protein, a molecule on the surface of a second cell, a 
molecule in the extracellular milieu, a molecule 
associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A target molecule can be a 
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Libraries of compounds may be presented in solution 
(e.g., Houghten (1992) Bio/Techniques 13:412-421), or on 
beads (Lam (1991) Nature 354 :82-84) , chips (Fodor (1993) 
Nature 364:555-556), bacteria (U.S. Patent No. 

5 5,223,409), spores (Patent NOS, 5,571,698; 5,403,484; and 
5,223,409), plasmids (Cull et al . (1992) Proc. Nati. 
Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
(1990) Science 249'!386-390 ; Devlin (1990) Science 
249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci, 

0 USA 87:6378-6382; and Felici (1991) J. Mol . Biol. 
222 :301-310) . 

In one embodiment, an assay is a cell-based assay in 
which a cell which expresses a membrane -bound form of a 
polypeptide of the invention, or a biologically active 

5 portion thereof, on the cell surface is contacted with a 
test compound and the ability of the test compoiind to 
bind to the polypeptide determined. The cell, for 
example, can be a yeast cell or a cell of mammalian 
origin. Determining the ability of the test compound to 

0 bind to the polypeptide can be accomplished, for example, 
by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to 
the polypeptide or biologically active portion thereof 
can be determined by detecting the labeled compound in a 

5 complex. For example, test compounds can be labeled with 
^"I, ^^S, ^^C, or ^H, either directly or indirectly, and 
the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. 
Alternatively, test compounds can be enzymatically 

0 labeled with, for example, horseradish peroxidase, 

alkaline phosphatase, or lucif erase, and the enzymatic 
label detected by determination of conversion of an 
appropriate substrate to product. In a preferred 
embodiment, the assay comprises contacting a cell which 

5 expresses a membrane -bound form of a polypeptide of the 
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polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
ability cf the test compound to interact with the 
polypeptide, wherein determining the ability of the test 
5 compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or biologically 
active portion thereof as compared to the known compound. 
In another embodiment, an assay is a cell -free assay 

10 comprising contacting a polypeptide of the invention or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 
polypeptide or biologically active portion thereof. 

15 Determining the ability of the test compound to modulate 
the activity of the polypeptide can be accomplished, for 
example, by determining the ability of the polypeptide to 
bind to a target molecule by one of the methods described 
above for determining direct binding. In an alternative 

20 embodiment, determining the ability of the test compound 
to modulate the activity of the polypeptide can be 
accomplished by determining the ability of the 
polypeptide of the invention to further modulate the 
target molecule. For example, the catalytic/enzymatic 

25 activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
comprises contacting a polypeptide of the invention or 
biologically active portion thereof with a known compound 

30 which binds the polypeptide to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with the polypeptide, wherein determining the ability of 
the test compound to interact with the polypeptide 

35 comprises determining the ability of the polypeptide to 
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polypeptide of the invention or some other polypeptide or 
protein. For example, a target molecule can be a 
component of a signal transduction pathway which 
facilitates transduction of an extracellular signal 
5 (e.g., a signal generated by binding of a compound to a 
polypeptide of the invention) through the cell membrane 
and into the cell or a second intercellular protein which 
has catalytic activity or a protein which facilitates the 
association of downstream signaling molecules with a 
10 polypeptide of the invention. Determining the ability of 
a polypeptide of the invention to bind to or interact 
with a target molecule can be accomplished by determining 
the activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
15 detecting induction of a cellular second messenger of the 
target (e.g., intracellular Ca^*, diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a regulatory element 
20 that is responsive to a polypeptide of the invention 

operably linked to a nucleic acid encoding a detectable 
marker, e.g. luciferase) , or detecting a cellular 
response, for example, cellular differentiation, or cell 
proliferation. 

25 In yet another embodiment, an assay of the present 
invention is a cell-free assay comprising contacting a 
polypeptide of the invention or biologically active 
portion thereof with a test compound and determining the 
ability of the test compound to bind to the polypeptide 

30 or biologically active portion thereof. Binding of the 
test compound to the polypeptide can be determined either 
directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the 
polypeptide of the invention or biologically active 

3 5 portion thereof with a known compound which binds the 
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Chemical; St. Louis, MO) or glutathione derivatized 
microtitre plates, which are then combined with the test 
compound or the test compound and either the non- adsorbed 
target protein or A polypeptide of the invention, and the 
5 mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and 
pH) . Following incubation, the beads or microtitre plate 
wells are washed to remove any unbound components and 
complex formation is measured either directly or 

10 indirectly, for example, as described above. 

Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the 
polypeptide of the invention can be determined using 
standard techniques . 

15 Other techniques for immobilizing proteins on matrices 
can also be used in the screening assays of the 
invention. For example, either the polypeptide of the 
invention or its target molecule can be immobilized 
utilizing conjugation of biotin and streptavidin. 

20 Biotinyiated polypeptide of the invention or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin-coated 

25 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with the polypeptide of the ^invention 
or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target 
molecule can be derivatized to the wells of the plate, 

3 0 and unbound target or polypeptidede of the invention 
trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those 
described above for the GST- immobilized complexes, 
include immunodetection of complexes using antibodies 

35 reactive with the polypeptide of the invention or target 
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preferentially bind to or modulate the activity of a 
target molecule. 

The cell-free assays of the present invention are 
amenable to use of both a soluble form or the membrane- 

5 bound form of a polypeptide of the invention. In the 
case of cell -free assays comprising the membrane -bound 
form of the polypeptide, it may be desirable to utilize a 
solubilizing agent such that the membrane -bound form of 
the polypeptide is maintained in solution. Examples of 

0 such solubilizing agents include non- ionic detergents 
such as n-octylglucoside, n-dodecylglucoside, n- 
dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
methylglucamide, Triton X-100, Triton X-114, Thesit, 
Isotridecypoly (ethylene glycol ether) n, 3-[(3- 

5 cholamidopropyl) dimethylamminio] -1-propane sulfonate 
(CHAPS) , 3- [ (3 -cholamidopropyl) dimethylamminio] -2- 
hydroxy- 1-propane sulfonate (CHAPSO) , or N-dodecyl=N, N- 
dimethyl- 3 -ammonio- 1-propane sulfonate . 

In more than one embodiment of the above assay methods 

0 of the present invention, it may be desirable to 

immobilize either the polypeptide of the invention or its 
target molecule to facilitate separation of complexed 
from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding 

5 of a test compound to the polypeptide, or interaction of 
the polypeptide with a target molecule in the presence 
and absence of a candidate compound, can be accomplished 
in any vessel suitable for containing the reactants. 
Examples of such vessels include microtitre plates, test 

0 tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided which adds a domain that 
allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transf erase fusion 
proteins or glutathione-S-transf erase fusion proteins can 
5 be adsorbed onto glutathione sepharose beads (Sigma 
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No. WO 94/10300), to identify other proteins, which bind 
to or interact with the polypeptide of the invention and 
modulate activity of the polypeptide of the invention. 
Such binding proteins are also likely to be involved in 
5 the propagation of signals by the polypeptide of the 
inventions as, for example, upstream or downstream 
elements of a signaling pathway involving the polypeptide 
of the invention. 

This invention further pertains to novel agents 
10 identified by the above -described screening assays and 
uses thereof for treatments as described herein. 

B . Detection Assavs 

Portions or fragments of the cDNA sequences identified 
herein (and the corresponding complete gene sequences) 

15 can be used in numerous ways as polynucleotide reagents. 
For example, these sequences can be used to: (i) map 
their respective genes on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 

20 (tissue typing) ; and (iii) aid in forensic identification 
of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a 
25 gene has been isolated, this sequence can be used to map 
the location of the gene on a chromosome. Accordingly, 
nucleic acid molecules described herein or fragments 
thereof, can be used to map the location of the 
corresponding genes on a chromosome. The mapping of the 
30 sequences to chromosomes is an important first step in 
correlating these sequences with genes associated with 
disease. 
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molecule, as well as enzyme -linked assays which rely on 
detecting an enzymatic activity associated with the 
polypeptide of the invention or target molecule. 

In another embodiment; modulators of expression of a 
polypeptide of the invention are identified in a method 
in which a cell is contacted with a candidate compound 
and the expression of the selected mRNA or protein (i.e., 
the mRNA or protein corresponding to a polypeptide or 
nucleic acid of the invention) in the cell is determined. 
The level of expression of the selected mRNA or protein 
in the presence of the candidate compound is compared to 
the level of expression of the selected mRNA of protein 
in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of 
expression of the polypeptide of the invention based on 
this comparison. For example, when expression of the 
selected mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is 
identified as a stimulator of the selected mRNA or 
protein expression. Alternatively, when expression of 
the selected mRNA or protein is less (statistically 
significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is 
identified as an inhibitor of the selected mRNA or 
protein expression. The level of the selected mRNA or 
protein expression in the cells can be determined by 
methods described herein. 

In yet another aspect of the invention, a polypeptide 
of the inventions can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J. Biol, Chem. 268:12046-12054; 
Bartel et al . (1993) Bio/Technigues 14:920-924; Iwabuchi 
ec al. (1993) Oncogene 8:1693-1696; and PCT Publication 
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marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
5 families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 

10 map data. (Such data are found, for example, in V. 

McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 

15 linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al. (1987) Nature 
325 :783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 

20 associated with a gene of the invention can be 

determined. If a mutation is observed in some or all of 
the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the 
causative agent of the particular disease. Comparison of 

25 affected and unaffected individuals generally involves 
first looking for structniral alterations in the 
chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR 
based on that DNA sequence. Ultimately, complete 

3 0 sequencing of genes from several individuals can be 
performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 
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Briefly, genes can be mapped to chromosomes by- 
preparing PGR primers (preferably 15-25 bp in length) 
from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can 
5 be used to rapidly select primers that do not span more 
than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing 
individual human chromosomes. Only those hybrids 
10 containing the human gene corresponding to the gene 

sequences will yield an amplified fragment. For a review 
of this technique, see D'Eustachio et al . ({1983) Science 
220:919-924) . 

PCR mapping of somatic cell hybrids is a rapid 
15 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the nucleic acid sequences of the invention to design 
oligonucleotide primers, sublocalization can be achieved 
20 with panels of fragments from specific chromosomes. 

Other mapping strategies which can similarly be used to 
map a gene to its chromosome include in situ 
hybridization (described in Fan et al. (1990) Proc. Natl. 
Acad. Sci. USA 87:6223-27), pre-screening with labeled 

2 5 flow- sorted chromosomes (CITE) / and pre- selection by 

hybridization to chromosome specific cDNA libraries 
(CITE) . Fluorescence in situ hybridization (FISH) of a 
DNA sequence to a metaphase chromosomal spread can 
further be used to provide a precise chromosomal location 

3 0 in one step. For a review of this technique, see Verma 

ec al., (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
35 on that chromosome, or panels of reagents can be used for 
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individual humans occurs with a frequency of about once 
per each 500 bases. Each of the sequences described 
heasein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for 
5 identification purposes. Because greater numbers of 
polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. 
The noncoding sequences of SEQ ID N0:1 can comfortably 
provide positive individual identification with a panel 

10 of perhaps 10 to 1,000 primers which each yield a 

noncoding amplified sequence of 100 bases. If predicted 
coding sequences, such as those in SEQ ID NO: 3 are used, 
a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

15 If a panel of reagents from the nucleic acid sequences 
described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 

20 positive identification of the individual, living or 
dead, can be made from extremely small tissue samples, 

3. Use of Partial Gene Sequences in Forensic Biology 
DNA-based identification techniques can also be used in 
forensic biology. Forensic biology is a scientific field 

25 employing genetic typing of biological evidence found at 
a crime scene as a means for positively identifying, for 
example, a perpetrator of a crime. To make such an 
identification, PGR ^.echnology can be used to amplify DNA 
sequences taken from very small biological samples such 

30 as tissues, e.g., hair or skin, or body fluids, e.g., 
blood, saliva, or semen found at a crime scene. The 
amplified sequence can then be compared to a standard, 
thereby allowing identification of the origin of the 
biological sample. 
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2. Tissue Typing 

The nucleic acid sequences of the present invention can 
also be used to identify individuals from minute 
biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP {described in 
U.S. Patent 5, 272, 057) . 

Furthermore, the sequences of the present invention can 
be used to provide an alternative technique which 
determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
nucleic acid sequences described herein can be used to 
prepare two PGR primers from the 5' and 3' ends of the 
sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, 
prepared in this manner, can provide unique individual 
identifications, as each individual will have a unique 
set of such DNA sequences due to allelic differences. 
The sequences of the present invention can be used to 
obtain such identification sequences from individuals and 
from tissue. The nucleic acid sequences of the invention 
uniquely represent portions of the human genome. Allelic 
variation occurs to some degree in the coding regions of 
these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between 
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to diagnostic assays for determining expression of a 
polypeptide or nucleic acid of the invention and/or 
activity of a polypeptide of the invention, in the 
context of a biological sample (e.g., blood, serum, 
5 cells, tissue) to thereby determine whether an individual 
is afflicted with a disease or disorder, or is at risk of 
developing a disorder, associated with aberrant 
expression or activity of a polypeptide of the invention. 
The invention also provides for prognostic (or 

10 predictive) assays for determining whether an individual 
is at risk of developing a disorder associated with 
aberrant expression or activity of a polypeptide of the 
invention. For example, mutations in a gene of the 
invention can be assayed in a biological sample. Such 

15 assays can be used for prognostic or predictive purpose 
to thereby prophylactically treat an individual prior to 
the onset of a disorder characterized by or associated 
with aberrant expression or activity of a polypeptide of 
the invention. 

20 Another aspect of the invention provides methods for 
expression of a nucleic acid or polypeptide of the 
invention or activity of a polypeptide of the invention 
in an individual co thereby select appropriate 
therapeutic or prophylactic agents for that individual 

25 (referred to herein as "pharmacogenomics" ) . 

Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of tho individual examined to 

30 determine the ability of the individual to respond to a 
particular agent) . 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of a polypeptide 

35 of the invention in clinical trials. 
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The sequences of the present invention can be used to 
provide polynucleotide reagents, e.g., PGR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
5 identifications by, for example, providing another 

"identification marker'* (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alteznative to patterns 

10 formed by restriction enzyme generated fragments. 

Sequences targeted to noncoding regions are particularly 
appropriate for this use as greater numbers of 
polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. 

15 Examples of polynucleotide reagents include the nucleic 
acid sequences of the invention or portions thereof, 
e.g., fragments derived from noncoding regions having a 
length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further 

20 be used to provide polynucleotide reagents, e.g., labeled 
or labelable probes which can be used in, for example, an 
in situ hybridization technique, to identify a specific 
tissue, e.g., brain tissue. This can be very useful in 
cases where a forensic pathologist is presented with a 

25 tissue of unknown origin. Panels of such probes can be 
used to identify tissue by species and/or by organ type. 

C. Predictive Medicine 

The present invention also pertains to the field of 
predictive medicine in which diagnostic assays, 
3 0 prognostic assays, pharmacogenomics, and monitoring 
clinical trails are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 



wo 00/18800 



PCT/US99/22818 



- 82 - 

indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples 
of indirect labeling include detection of a primary 
antibody using a f luorescently labeled secondary antibody 
5 and end-labeling of a DNA probe with biotin such that it 
can be detected with f luorescently labeled streptavidin. 
The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present 

10 within a subject. That is, the detection method of the 
invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of mRNA 
include Northern hybridizations and in situ 

15 hybridizations. In vitro techniques for detection of A 
polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs) , Western blots, 
immunoprecipi tat ions and immunofluorescence. In vitro 
techniques for detection of genomic DNA include Southern 

20 hybridizations. Furthermore, in vivo techniques for 
detection of a polypeptide of the invention include 
introducing into a subject a labeled antibody directed 
against the polypeptide. For example, the antibody can 
be labeled with a radioactive marker whose presence and 

25 location in a subject can be detected by standard imaging 
techniques . 

In one embodiment, the biological sample contains 
protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
3 0 test subject or genomic DNA molecules from the test 

subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 
from a subject. 

In another embodiment, the methods further involve 
35 obtaining a control biological sample from a control 
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These and other agents are described in further detail 
in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
absence of a polypeptide or nucleic acid of the invention 
in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting a 
polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 
the invention such that the presence of a polypeptide or 
nucleic acid of the invention is detected in the 
biological sample. A preferred agent for detecting tnRNA 
or genomic DNA encoding a polypeptide of the invention is 
a labeled nucleic acid probe capable of hybridizing to 
mRNA or genomic DNA encoding a polypeptide of the 
invention. The nucleic acid probe can be, for example, a 
full-length cDNA, such as the nucleic acid of SEQ ID N0:1 
or 4, or a portion thereof, such as an oligonucleotide of 
at least 15, 30, 50, 100, 250 or 500 nucleotides in 
0 length and sufficient to specifically hybridize under 
stringent conditions to a mRNA or genomic DNA encoding a 
polypeptide of the invention. Other suitable probes for 
use in the diagnostic assays of the invention are 
described herein. 
5 A preferred agent for detecting A polypeptide of the 
invention is an antibody capable of binding to A 
polypeptide of the invention, preferably an antibody with 
a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a 
0 fragment thereof (e.g., Fab or F(ab')2) can be used. The 
term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or 
antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as 
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For oligonucleotide-based kits, the kit may comprise, 
for example: (l) an oligonucleotide, e.g., a detectably 
labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide of the invention or 
5 (2) a pair of primers useful for amplifying a nucleic 
acid molecule encoding a polypeptide of the invention. 

The kit may also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit 
may also comprise components necessary for detecting the 

10 detectable agent (e.g., an enzyme or a substrate). The 
kit may also contain a control sample or a series of 
control samples which can be assayed and compared to the 
test sample contained. Each component of the kit is 
usually enclosed within an individual container and all 

15 of the various containers are within a single package 
along with instructions for observing whether the tested 
subject is suffering from or is at risk of developing a 
disorder associated with aberrant expression of the 
polypeptide. 

20 2 . Procrnostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant expression or activity 

25 of a polypeptide of the invention. For example, the 

assays described herein, such as the preceding diagnostic 
assays or the following assays, can be utilized to 
identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity 

30 of a polypeptide of the invention. Alternatively, the 
prognostic assays can be utilized to identify a subject 
having or at risk for developing such a disease or 
disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a 
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subject, contacting the control sample with a compound or 
agent capable of detecting a polypeptide of the invention 
or mRNA or genomic DNA encoding a polypeptide of the 
invention, such that the presence of the polypeptide or 
5 mRNA or genomic DNA encoding the polypeptide is detected 
in the biological sample, and comparing the presence of 
the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of 
the polypeptide or mRNA or genomic DNA encoding the 
10 polypeptide in the test sample. 

The invention also encompasses kits for detecting the 
presence of a polypeptide or nucleic acid of the 
invention in a biological sample (a test sample) . Such 
kits can be used to determine if a subject is suffering 
15 from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of 
the invention (e.g., an immunological disorder). For 
example, the kit can comprise a labeled compound or agent 
capable of detecting the polypeptide or mRNA encoding the 
20 polypeptide in a biological sample and means for 

determining the amount of the polypeptide or mRNA in the 
sample (e.g., an antibody which binds the polypeptide or 
an oligonucleotide probe which binds to DNA or mRNA 
encoding the polypeptide) . Kits may also include 
25 instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide if 
the amount of the polypeptide or mRNA encoding the 
polypeptide is above or below a normal level. 
30 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to a polypeptide of the invention; 
and, optionally, (2) a second, different antibody which 
binds to either the polypeptide or the first antibody and 
3 5 is conjugated to a detectable agent. 



wo 00/18800 



PCT/US99/22818 



- 86 - 

In preferred embodiments, the methods include detecting, 
in a sample of cells from the subject, the presence or 
absence of a genetic lesion or mutation characterized by 
at least one of an alteration affecting the integrity of 
5 a gene encoding the polypeptide of the invention, or the 
mis-expression of the gene encoding the polypeptide of 
the invention. For example, such genetic lesions or 
mutations can be detected by ascertaining the existence 
of at least one of: Da deletion of one or more 

10 nucleotides from the gene; 2) an addition of one or more 
nucleotides to the gene; 3) a substitution of one or more 
nucleotides of the gene; 4) a chromosomal rearrangement 
of the gene; 5) an alteration in the level of a messenger 
RNA transcript of the gene; 6) an aberrant modification 

15 of the gene, such as of the methylation pattern of the 
genomic DNA; 7) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of the gene; 8) a 
non-wild type level of a the protein encoded by the gene; 
9) an allelic loss of the gene; and 10) an inappropriate 

20 post-translational modification of the protein encoded by 
the gene. As described herein, there are a large number 
of assay techniques known in the art which can be used 
for detecting lesions in a gene. 

In certain embodiments, detection of the lesion 

2 5 involves the use of a probe/primer in a polymerase chain 
reaction (PGR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PGR or RACE PGR, or, 
alternatively, in a ligation chain reaction (LGR) (see, 
e.g., Landegran et al , (1988) Science 241:1077-1080; and 

30 Nakazawa et al . (1994) Proc . Natl, Acad, Sci. USA 91:360- 
364), the latter of which can be particularly useful for 
detecting point mutations in a gene (see, e.g., Abravaya 
et al . (1995) Nucleic Acids Res. 23:675-682). This 
method can include the steps of collecting a sample of 

35 cells from a patient, isolating nucleic acid (e.g.. 
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polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 
the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
5 associated with aberrant expression or activity of the 
polypeptide. As used herein, a "test sample" refers to a 
biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 
10 Furthermore, the prognostic assays described herein can 
be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
15 disorder associated with aberrant expression or activity 
of a polypeptide of the invention. For example, such 
methods can be used to determine whether a subject can be 
effectively treated with a specific agent or class of 
agents (e.g., agents of a type which decrease activity of 
20 the polypeptide) . Thus, the present invention provides 
methods for determining whether a subject can be 
effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention in which a test sample is 
25 obtained and the polypeptide or nucleic acid encoding the 
polypeptide is detected (e.g., wherein the presence of 
the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a 
disorder associated with aberrant expression or activity 
3 0 of the polypeptide) . 

The methods of the invention can also be used to detect 
genetic lesions or mutations in a gene of the invention, 
thereby determining if a subject with the lesioned gene 
is at risk for a disorder characterized aberrant 
35 expression or activity of a polypeptide of the invention. 
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specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations can be 
identified by hybridizing a sample and control nucleic 
5 acids, e.g., DNA or RNA, to high density arrays 

containing hundreds or thousands of oligonucleotides 
probes (Cronin et al . (1996) Human Mutation 7:244-255; 
Kozal et al . (1996) Nature Medicine 2:753-759). For 
example, genetic mutations can be identified in two- 

10 dimensional arrays containing light -generated DNA probes 
as described in Cronin et al . , supra. Briefly, a first 
hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify 
base changes between the sequences by making linear 

15 arrays of sequential overlapping probes. This step 

allows the identification of point nrtutations. This step 
is followed by a second hybridization array that allows 
the characterization of specific mutations by using 
smaller, specialized probe arrays complementary to all 

20 variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the 
wild- type gene and the other complementary to the mutant 
gene . 

In yet another embodiment, any of a variety of 
25 sequencing reactions known in the art can be used to 

directly sequence the selected gene and detect mutations 
by comparing the sequence of the sample nucleic acids 
with the corresponding wild- type (control) sequence. 
Examples of sequencing reactions include those based on 
30 techniques developed by Maxim and Gilbert ((1977) Proc. 
Natl. Acad. Sci, USA 74:560) or Sanger ((1977) Proc. 
Natl. Acad. Sci. USA 74:5463). It is also contemplated 
that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays 
35 ((1995) Sio/Technigues 19:448) , including sequencing by 
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genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to the selected gene 
under conditions such that hybridization and 
amplification of the gene {if present) occurs, and 
detecting the presence or absence of an amplification 
product, or detecting the size of the amplification 
product and comparing the length to a control sample. It 
is anticipated that PGR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method/ followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 
schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in 
very low numbers. 

In an alternative embodiment, mutations in a selected 
gene from a sample cell can be identified by alterations 
in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,4 98,531) can be used to score for the presence of 
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E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a 
5 selected sequence, e.g., a wild- type sequence, is 
hybridized to a cDNA or other DNA product from a test 
cell(s) . The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. 

10 See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic 
mobility will be used to identify mutations in genes. 
For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in 

15 electrophoretic mobility between mutant and wild type 

nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:2766; see also Cotton (1993) Mutat. Res. 285:125- 
144; Hayashi (1992) Genet. Anal. Tech. Appl . 9:73-79). 
Single- stranded DNA fragments of sample and control 

20 nucleic acids will be denatured and allowed to renature. 
The secondary stanacture of single- stranded nucleic acids 
varies according to sequence, and the resulting 
alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA 

25 fragmencs may be labeled or detected with labeled probes. 
The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA) , in which the secondary structure is 
more sensitive to a change in sequence. In a preferred 
embodiment, the subject method utilizes heteroduplex 

3 0 analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic 
mobility (Keen et al . (1991) Trends Genet. 7:5). 

In yet another embodiment, the movement of mutant or 
wild- type fragments in polyacrylamide gels containing a 

3 5 gradient of denaturant is assayed using denaturing 
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mass spectrometry {see, e.g., PCT Publication No. WO 
94/16101; Cohen et al . (1996) Adv, Chromatogr. 36:127- 
162; and Griffin et al . (1993) Appl. Biochem. Biotechnol , 
38:147-159) . 

5 Other methods for detecting mutations in a selected 
gene include methods in which protection from cleavage 
agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al . (1985) Science 
230:1242). In general, the technique of ^'mismatch 

10 cleavage" entails providing heteroduplexes formed by 

hybridizing (labeled) RNA or DNA containing the wild- type 
sequence with potentially mutant RNA or DNA obtained from 
a tissue sample. The double- stranded duplexes are 
treated with an agent which cleaves single- stranded 

15 regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample 
strands, RNA/DNA duplexes can be treated with RNase to 
digest mismatched regions, and DNA/DNA hybrids can be 
treated with SI nuclease to digest mismatched regions. 

20 In other embodiments, either DNA/DNA or RNA/DNA duplexes 
can be treated with hydroxylamine or osmium tetroxide and 
with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 

25 polyacrylamide gels to determine the site of mutation. 
See, e.g.. Cotton et al. (1988) Proc. Natl. Acad. Sci. 
USA 85:4397; Saleeba et al . (1992) Methods Enzymol . 
217:286-295. In a preferred embodiment, the control DNA 
or RNA can be labeled for detection. 

30 In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
mismatched base pairs in double- stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in cDNAs obtained 
3 5 from samples of cells. For example, the mutY enzyme of 
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gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a ^GC clamp of 
5 approximately 4 0 bp of high-melting GC-rich DNA by PGR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys . Chem. 265:12753). 
10 Examples of other techniques for detecting point 
mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 
15 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci. USA 86:6230). Such allele specific oligonucleotides 
20 are hybridized to PGR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification technology 
25 which depends on selective PGR amplification may be used 
in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 
30 differential hybridization) (Gibbs et al . (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238). In addition, it may be desirable to 
35 introduce a novel restriction site in the region of the 
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mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 
5 (1991) Proc. Natl. Acad. Sci , USA 88:189). In such 
cases, ligation will occur only if there is a perfect 
match az the 3' end of the 5' sequence making it possible 
to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

10 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
. reagent described herein, which may be conveniently used, 

15 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a gene encoding a polypeptide of the 
invention. 

Furthermore, any cell type or tissue, preferably 
20 peripheral blood leukocytes, in which the polypeptide of 
the invention is expressed may be utilized in the 
prognostic assays described herein. 

3 . Pharmacoaenomics 

25 Agents, or modulators which have a stimulatory or 
inhibitory effect on activity or expression of a 
polypeptide of the invention as identified by a screening 
assay described herein can be administered to individuals 
to treat (prophylactically or therapeutically) disorders 

3 0 associated with aberrant activity of the polypeptide. In 
conjunccion with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 

35 considered. Differences in metabolism of therapeutics 
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can lead to severe toxicity or therapeutic failure by 
altering the relation oetween dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 
5 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

10 Accordingly, the activity of a polypeptide of the 
invention, expression of a nucleic acid of the 
invention, or mutation content of a gene of the invention 
in an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

15 treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 43(2):254- 

20 266. In general, two types of pharmacogenetic conditions 
can be differentiated. Genetic conditions transmitted as 
a single factor altering the way drugs act on the body 
are referred to as "altered drug action." Genetic 
conditions transmitted as single factors altering the way 

2 5 the body acts on drugs are referred to as "altered drug 

metabolism" . These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, 
glucose-6 -phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical 
30 complication is haemolysis after ingestion of oxidant 
drugs (anti-malarials , sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug 
metabolizing enzymes is a major determinant of both the 

3 5 intensity and duration of drug action. The discovery of 



wo 00/18800 



PCT/US99/228i8 



- 94 - 

genetic polymorphisms of drug metabolizing enzymes (e.g. , 
N-acetyl transferase 2 (NAT 2) and cytochrome P450 enzymes 
CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or 
5 show exaggerated drug response and serious toxicity after 
taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the 
population, the extensive metabolizer (EM) and poor 
metabolizer (PM) . The prevalence of PM is different 

10 among different populacions . For example, the gene 
coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

15 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of 
codeine mediated by its CYP2D6- formed metabolite 

20 morphine. The other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. 
Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene 
amplification. 

2 5 Thus, the activity of a polypeptide of the invention, 
expression of a nucleic acid encoding the polypeptide, .or 
mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

30 treatment of the individual. In addition, 

pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug-metabolizing enzymes 
to the identification of an individual's drug 
responsiveness phenccype. This knowledge, when applied 

35 to dosing or drug selection, can avoid adverse reactions 
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or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 
modulator of activity or expression of the polypeptide, 
such as a modulator identified by one of the exemplary 
5 screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or activity of a polypeptide 
of the invention (e.g., the ability to modulate aberrant 

10 cell proliferation and/or differentiation) can be applied 
not only in basic drug screening, but also in clinical 
trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to 
increase gene expression, protein levels or protein 

15 activity, can be monitored in clinical trials of subjects 
exhibiting decreased gene expression, protein levels, or 
protein activity. Alternatively, the effectiveness of an 
agent, as determined by a screening assay, to decrease 
gene expression, protein levels or protein activity, can 

20 be monitored in clinical trials of subjects exhibiting 
increased gene expression, protein levels, or protein 
activity. In such clinical trials, expression or 
activity of a polypeptide of the invention and 
preferably, that of other polypeptide that have been 

25 implicated in for example, a cellular proliferation 
disorder, can be used as a marker of the immune 
responsiveness of a particular cell. 

For example, and not by way of limitation, genes, 
including those of the invention, that are modulated in 

30 cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates activity or expression of 
a polypeptide of the invention (e.g., as identified in a 
screening assay descrioed herein) can be identified. 
Thus, to study the effect of agents on cellular 
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proliferation disorders, for example, in a clinical 
trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of a gene of the 
invention and other genes implicated in the disorder. 
5 The levels of gene expression (i.e., a gene expression 
pattern) can be quantified by Northern blot analysis or 
RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the 
methods as described herein, or by measuring the levels 
0 of activity of a gene of the invention or other genes. 
In this way, the gene expression pattern can serve as a 
marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 
5 treatment of the individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
0 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of the polypeptide or nucleic acid of 
5 the invention in the preadministration sample; (iii) 
obtaining one or more post -administration samples from 
the subject; (iv) detecting the level the of the 
polypeptide or nucleic acid of the invention in the post- 
administration samples; (v) comparing the level of the 
0 polypeptide or nucleic acid of the invention in the pre- 
administration sample with the level of the polypeptide 
or nucleic acid of the invention in the post- 
administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 
5 For example, increased administration of the agent may be 
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desirable to increase the expression or activity of the 
polypeptide to higher levels than detected, i.e., to 
increase the effectiveness of the agent. Alternatively, 
decreased administration of the agent may be desirable to 
5 decrease expression or activity of the polypeptide to 
lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

C. Methods of Treatment 

The present invention pro^fides for both prophylactic 
10 and therapeutic methods of treating a subject at risk of 
(or susceptible to) a disorder or having a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention. 

1 . Prophylactic Methods 

15 In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant expression or activity of a 
polypeptide of the invention, by administering to the 
subject an agent which modulates expression or at least 

20 one activity of the polypeptide. Subjects at risk for a 
disease which is caused or contributed to by aberrant 
expression or accivity of a polypeptide of the invention 
can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. 

2 5 Administration of a prophylactic agent can occur prior to 
the manifestation of symptoms characteristic of the 
aberrancy, such that a disease or disorder is prevented 
or, alternatively, delayed in its progression. Depending 
on the type of aberrancy, for example, an agonist or 

30 antagonist agent can be used for treating the subject. 
The appropriate agent can be determined based on 
screening assays described herein. 
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2 . Therapeutic Methods 

Another aspecc of the invention pertains to methods of 
modulating expression or activity of a polypeptide of the 
invention for therapeutic purposes. The modulatory 
5 method of the invention involves contacting a cell with 
an agent that modulates one or more of the activities of 
the polypeptide. An agent that modulates activity can be 
an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of the 
10 polypeptide, a peptide, a peptidomimetic , or other small 
molecule. In one embodiment, the agent stimulates one or 
more of the biological activities of the polypeptide. 
Examples of such stimulatory agents include the active 
polypeptide of the invention and a nucleic acid molecule 
15 encoding the polypeptide of the invention that has been 
introduced into the cell. In another embodiment, the 
agent inhibits one or more of the biological activities 
of the polypeptide of the invention. Examples of such 
inhibitory agents include antisense nucleic acid 
20 molecules and antibodies. These modulatory methods can 
be performed in vitro (e.g., by culturing the cell with 
the agent) or, alternatively, in vivo (e.g, by 
administering the agent to a subject) . As such, the 
present invention provides methods of treating an 
25 individual afflicted with a disease or disorder 

characterized by aberrant expression or activity a 
polypeptide of the invention. In one embodiment, the 
method involves administering an agent (e.g., an agent 
identified by a screening assay described herein) , or 
30 combination of agents that modulates (e.g., upregulates 
or downregulates) expression or activity. In another 
embodiment, the method involves administering a 
polypeptide of the invention or a nucleic acid molecule 
of the invention as therapy to compensate for reduced or 
35 aberrant expression or activity of the polypeptide. 
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Stimulation of activity is desirable in situations in 
which activity or expression is abnormally low 
downregulated and/or in which increased activity is 
likely to have a beneficial effect. Conversely, 
5 inhibition of activity is desirable in situations in 
which activity or expression is abnormally high or 
upregulated and/or in which decreased activity is likely 
to have a beneficial effect. 

This invention is further illustrated by the following 
10 examples which should not be construed as limiting. The 
contents of all references, patents and published patent 
applications cited throughout this application are hereby 
incorporated by reference. 

Equivalents 

15 Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, 
many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are 
intended to be encompassed by the following claims. 

20 What is claimed is: 



wo 00/18800 



PCT/US99/22818 



- 100 - 

1, An isolaced nucleic acid molecule selected from 
the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide 
sequence which is at least 55% identical to the 
5 nucleotide sequence of SEQ ID N0:1, 3, 4, or 6, the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , the cDNA insert of the plasmid deposited 

with ATCC as Accession Number , or a complement 

thereof; 

10 b) a nucleic acid molecule comprising a fragment of 
at least 3 00 nucleotides of the nucleotide sequence of 
SEQ ID N0:1, 3, 4, or 6 , the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , the 

cDNA insert of the plasmid deposited with ATCC as 

15 Accession Number / or a complement thereof; 

c) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with ATCC as Accession Number 

20 , or the amino acid sequence of the cDNA insert of 

the plasmid deposited with ATCC as Accession Number 

d) a nucleic acid molecule which encodes a fragment 
of a polypeptide comprising the amino acid sequence of 

25 SEQ ID NO: 2 or 5, the polypeptide encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number _/ or the polypeptide encoded by the 

cDNA insert of the plasmid deposited with ATCC as 
Accession Number , wherein the fragment comprises 

3 0 at least 15 contiguous amino acids of SEQ ID NO: 2 or 5, 
the polypeptide encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the polypeptide encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number ; and 
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e) a nucleic acid moleccale which encodes a naturally 
occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 

5 deposited with ATCC as Accession Number , or the 

amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number , 

wherein the nucleic acid molecule hybridizes to a nucleic 
acid molecule comprising SEQ ID NO: 3 or 6 or a complement 
10 thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, 
which is selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence 
of SEQ ID N0:1, 3, 4, or 5 , the cDNA insert of the 

15 plasmid deposited with ATCC as Accession Number , 

the cDNA insert of the plasmid deposited with ATCC as 
Accession Number , or a complement thereof; and 

b) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 

2 0 NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number ^ , or the amino acid sequence encoded 

by the cDNA insert of the plasmid deposited with ATCC as 
Accession Number . 

25 3. The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 
polypeptide . 

30 5. A host cell which contains the nucleic acid 
molecule of claim 1 . 
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6. The host cell of claim 5 which is a mammalian host 
cell. 

7. A non-human mammalian host cell containing the 
nucleic acid molecule of claim 1. 

5 8 . An isolated polypeptide selected from the group 
consisting of: 

a) a fragment of a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 2 or 5, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 

10 NO: 2 or 5; 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

15 Number , or the amino" acid sequence encoded by 

the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID N0:1 or 4 or a 

2 0 complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid 
molecule comprising a nucleotide sequence which is at 
least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO: 1 or 4 or a complement 

25 thereof. 

9. The isolated polypeptide of claim 8 comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number , or 

3 0 the amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number . 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11 . An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12 . A method for producing a polypeptide selected from 
the group consisting of: 

a) a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2 or 5, the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 

10 Accession Number , or the amino acid sequence 

encoded by the cDNA insert of the plasmid deposited with 
ATCC as Accession Number ; 

b) a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 2 or 5, the amino acid 

15 sequence encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the amino acid sequence encoded by the cDNA insert of the 

o 

plasmid deposited with ATCC as Accession Number , 

wherein the fragment comprises e.t least 15 contiguous 

20 amino acids of SEQ I^ NO: 2 or 5, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with 

ATCC as Accession Number or the amino acid 

sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number ; and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number or the amino acid sequence encoded by 

3 0 the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
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nucleic acid molecule comprising SEQ ID NO: 1 or 4 or a 
complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under 
conditions in which the nucleic acid molecule is 
5 expressed. 

13 . A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 

10 b) determining whether the compound binds to the 
polypeptide in the sample. 

14 . The method of claim 13 , wherein the compound which 
binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
15 binds to a polypeptide of claim 8 and instructions for 

use . 

16. A method for detecting the presence of a nucleic 
acid molecule of claim 1 in a sample, comprising the 
steps of : 

2 0 a) contacting the sample with a nucleic acid probe or 
primer which selectively hybridizes to the nucleic acid 
molecule ; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

2 5 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe. 
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18. A kit corrprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 



19. A method for identifying a compound which binds to 
5 a polypeptide of claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a 
polypeptide of claim 8 with a test compound; and 

b) determining whether the polypeptide binds to the 
test c ompound , 

10 20. The method of claim 19, wherein the binding of the 

test compound to the polypeptide is detected by a method 

selected from the group consisting of: 

a) detection of binding by direct detecting of test 

compound/polypeptide binding; 
15 b) detection of binding using a competition binding 

assay; 

c) detection of binding using an assay for TANGO 191 
or TANGO 195-mediated signal transduction. 

21. A method for modulating the activity of a 
20 polypeptide of claim 8 comprising contacting a 

polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 
sufficient concentration to modulate the activity of the 
polypeptide . 

25 22. A method for identifying a ^compound which 
modulates the activity of a polypeptide of claim 8, 
comprising : 

a) contacting a polypeptide of claim 8 with a test 
compound ; and 
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b) determining the effect of the test compound on the 
activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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: GCGTCCGGATGTTTTCACTTTTGGGACATCCTGTTCTGAGTCAAGATTCCTCCTTCTGAA 

51 CATGGGACTTTCCAGAAGGACCACAGCTCC7CCCGTGCATCCACTCGGCCTGGGAGGTTC 

121 TGGAT7TTGGCTGTCGAGGGAGTTTGCCTGCCTCTCC AGAGAAAGATGGTCATGAGGCCC 
'I M V M R P 

1 S 1 CTGTGGAGTCTGCTTCTCTGGGAAGCCCTACTTCCCATT ACAGTTACTGGTGCCCAAGTG 
'"SLWSLLLWEALLPITVTGAQV 

2 4 1 CTGAGCAAAGTCGGGGGCTCGGTGCTGCTGGTGGCAGCGCGTCCCCCTGGCTTCCAAGTC 
26LSKVGGSVLLVAARPPGFQV 

3 0 1 CGTGAGGCTATCTGGCGATCTCTCTGGCCTTCAGAAGAGCTCCTGGCCACGTTTTTCCGA 
46REAIWRSLWPSEELLATFFR 

3 6 1 GGCTCCCTGGAGACTCTGTACCATTCCCGCTTCCTGGGCCGAGCCCAGCTACACAGCAAC 
66GSLETLYHSRFLGRAQLHSN 

421 CTC AGCCTGGAGCTCGGGCCGCTGGAGTCTGGAGACAGCGGCAACTTCTCCGTGTTGATG 
86-*SlELGPL£SGDSGNFSVLM 

481 GTGGACACAAGGGGCCAGCCCTGGACCCAGACCCTCCAGCTCAAGGTGTACGATGCAGTG 
106VDTRGQPWTQT LQLKVYDAV 

541 CCCAGGCCCGTGGTACAAGTGTTC ATTGCTGTAGAAAGGGATGCTCAGCCCTCCAAGACC 
126PR-PVVQVFIAVERDAQPSKT 

601 TGCCAGGTTTTCTTGTCCTGTTGGGCCCCCAACATC AGCGAAATAACCTATAGCTGGCGA 
146CQVFLSCWAPNISEITySWR 

661 CGGGAGACAACCATGGACTTTGGTATGGAACCACACAGCCTCTTCACAGACGGACAGGTG 
166RETTMDFGMEPHSLFTDGQV 

12' c'-GAGCATTTCCCTGGGACCAGGAGACAGAGATGTGGCCTATTCCTGCATTGTCTCCAAC 
laei'siSLGPGDR DVAYSC IVSN 

7 8 1 CCTGTCAGCTGGGACTTGGCCACAGTC ACGCCCTGGGATAGCTGTCATCATGAGGCAGCA 
206 PVSWDLATVTPWDSCHHEAA 

8 4 1 CCAGGGAAGGCCTCCTACAAAGATGTGCTGCTGGTGGTGGTGCCTGTCTCGCTGCTCCTG 
226 PGKASYKDVLLVVVPVSL.LL 

9 0 1 ATGCTGGTTACTCTCTTCTCTGCCTGGCACTGGTGCCCCTGCTCAGGGCCCCACCTCAGA 
246 ilLVT LFSAWHWC PCSG PH!:.R 

961 TCA.SAGCAGCTCTGGATGAGATGGGACCTGCAGCTCTCCCTCCCCAAGGTCACTCTTAGC 
266 SKQLWMRWDLQLSLPKVTLS 

1021 ij^cCTC ATTTCGACAGTGGTTTGTAGCGTGGTGC ACCAGGGCCTTGTTGAACAGATCCAC 
286 M LIST VVCSVVHQGLVEQIH 

1081 ACGTGCTCTAATAAAGTTCCCA 

306 7 C 3 >r K 7 P 
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HHEAAPGKASYKDVLLWVPVSLLLMLVTL - FSAWHWCPCSGPHLRSKQLWMRWD LQLSL 

RTDPSETKPWAVYAGLLGGVIMILIMWILQLRRRGKTNHYQTTVEKKS^ 
230 240 2S0 260 270 280 

280 290 300 310 

P - KVTLSNLISTWCSWHQGLVEQIHTC - - - SNKVPX 

PLQKKIJDSFPAQDPCTTIYVAATEPVPESVQETNSITVYASVTLPES 
290 300 310 320 330 
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Score: 7.10 Seq: 71 128 Model: 6 47 

•r.mmVsfhPcdYtlwWY. . . rNcOPi ZLtlnsViq 

7A)0&O 71 :,PFMGSOTLSDVQ--V^^fOQFSNGDPL£I:I?JCSVPKIICDKCTlHFLTPG :17 

yEDsGtVwCmV 

118 VNNSGSYICRP 128 



Score: 12.19 Seq: 168 223 Model; 1 47 

•GqsVTLTCmVs . . fhPodYt . IwWYrNooDX iLt InsWcvEDs 

tOi 16S GSTCSlSCPSLSCQSDAOSPAVTWYKNGKLLSVERSNRIVVDEVyDYHQ 216 

TAWGO 

GtYwCaV 
217 GTYVCDY 223 



score- 4.61 seq: 266 339 Model: 1 47 

TAUGa 1^1 CKPLTISCKAR5GFERVFMPVIKV/YIKDSDLrW£VSVPE.yCSIKSTlKD 3L4 

C LC InsWovEDs . G t YwCa V 

315 EIIEPNIILEKVTQRDUUUCFVCFV 339 
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input file T195Atmue9fll; Output File T195Atmue9f U • pat 
Sequence length 1603 

MWSLWSLLL 9 
CCCACGCGTCCGCCCACGCGTCCGCAAGGGAGGACAACGGCC ATG TGG TCC CTC TGG AGT CTT CTT CTC 6 9 

pVVVVSVQVLSKVGD 29 
TTT GAA GCT CTC CTT CCC GTT GTG GTT GTC AGT GTC CAA GTG CTA AGC AAG GTA GGG GAC 12 9 

-ELLVAECPPGFQVREAIWR 49 
TCA GAG CTG CTG GTG GCC GAG TGT CCT CCG GGC TTC CAA GTG CGT GAG GCT ATC TGG CGA 18 9 

SLWPSEELLATFFRGSLETL 69 
TCT CTG TGG CCA TCG GAG GAG CTC CTG GCC ACA TTT TTC CGA GGT TCC TTG GAG ACT CTG 24 9 

oFLGRVQLYDNLSLELG 89 
tIc CAC TCT CGT TTC CTG GGC CGA GTC CAG CTA TAT GAC AAC CTC AGC CTG GAG CTT GGA 309 
„.KPGDSaNFSV LMVDTRGQ109 
CCC CTG AAA CCT GGA GAC AGC GGC AAT TTC TCT GTG CTG ATG GTG GAT ACA AGO GGT CAA 369 
OTLyLKVYDAVPKPEVQ129 
ACC TGG ACC CAG ACC CTG TAT CTC AAG GTG TAC GAT GCA GTA CCC AAG CCC GAG GTT CAA 42 9 
VFTAAAEETQPL NTCQVFLS149 
GTC T?C ACT GCT GCA GCA GAG GAG ACC CAA CCC CTC AAT ACC TGT CAG GTC TTC TTG TCC 489 
pNlSDlTyS WRREGTVD169 
TGC T^G OCC CCC AAC ATC ACT GAC ATA ACC TAC AGC TGG CGA CGG GAG GGG ACA GTG GAC 54 9 
,„-EVHSHFs\gQVLSVSLG189 
T^C A^T GGT oIa olo CAC AGC CAT TTC TCA AAT GGA CAG GTG TTA ACT GTC TCA CTG GGA 609 
, ^nKDVAFTCIASNPVSWDM209 
CTC GGG GAC A^G GAT GTC GCC TTT ACC TGC ATT GCC TCC AAT CCT GTC AGC TCG GAT ATC 669 

ACC ACA GTC ACC CCC TGG GAG AGC TGC cIt clc GAG GCA GCC TCC GGG A^G GCC T^C TAC 729 
.nul.LVVVPlTLFLILAGLF249 
^0 GAC GTC C^A C^G GTA GTA GTC CCA ATT ACA CTC TTC CTC ATC CTC GCT GCT CTC TTT 789 
MUHGLCSGKKKDACTDGV269 
CGG GCA T^G CAC CAT GGC CTC TGC TCA GGG AAG AAG AAG GAT GCT TGC ACT GAC GGG GTC 849 

L P E T E N A L V • 
CTT CCA GAG ACA GAG AAT GCC CTC GTA TAG 

AGGATCTCATGAGGGACAACATAAACTGGTCCTTGGACCATGATGAGATCCCCTGCTCACCCACGTGATCCTCCACACC 958 
TGCACACCTCCAGGATCCTCATAACTGCCACAACTCGCCCTGCCTCTAGCGCACAGCCAAGAAAACCACCATCCTGGAA 1037 
CGTCACCCTCGCCCAAACTTCCTCCTTCCTCCATCCTGTTCGCACATCCCGGATCCTCTCTGGGCAAGGTGAACTAGTA 1116 
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GGATGCTTCC7TCAGAACACAGGACTTTCTCTAGGATCCACAGAGACATTGATTATCCAAGGCATCCATTCTTCTATCA 1195 

CTGTACATAAGGTCTTGCCCAACAGCCACCAAGGGACGGCCTCCAGGCC AGGACCTTGGCTCAAAGAGAGATGAGATGT 12 74 

TTGAACTAACATGGAAATTGAGCTAACCATTGCCAACTCCAACTCCAGCCCCTGGGGTCTGAGTTCCTTGTGTTCAAGA 13 5 3 

TGTTATTATAAGAAAAGGCAAAGAACAGGAAATGATGGAGGGTGGGGCATTCTCTTTCTGGTCTGAAGGACTTTAAGAT 14 3 2 

TATCTGAGTTCAAGGGCCATCAAAAGTAAATTGAGATTACAGATGATG AGGGGTTGGTAGCTAATGTGCCATGTTGGGA 1511 

TCAAAGCCATTTTTCTGGTAGCACTATATTAATAGACACCTTTGrrGCCATTTAAAAAAAAAAAAAAAAAAAAAAAAA^ 15 90 

AAAAAAAAAAAAA 16 03 
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input file T195AChpb93fl; Output File T19SAthpb93f 1 .pat 

Sequence length 1118 

GATGTTTTCACTTTTGGGACATCCTGTTCTGAGTCAAGATTCCTCCTTCTGAACATGGGACTTTCCAGAAGGACCACAG 79 

CTCCTCCCGTGCATCCACTCGGCCTGGGAGGTTCTGGATTTTGGCTGTCGAGGGAGTTTGCCTGCCTCTCCAGAGAAAG 1 5 8 

„V„RPLWSLLLWEALLFITV 20 
A?G GTC a!g AGG CCC CTG TGG AGT CTG CTT CTC TGG GAA GCC CTA CTT CCC ATT ACA GTT 218 
TC^AOVLSKVGGSVLLVAARP 40 
ACT GGT GCC CAA GTG CTG AGC AAA GTC GGG GGC TCG C-TG CTG CTG GTG GCA GCG CGT CCC 278 
orFOVREAIWRSLWPSEELL 60 
CCT GGC T?C CAA GTC CGT GAG GCT ATC TGG CGA TCT CTC TGG CCT TCA GAA GAG CTC CTG 338 
^TFFRGSLETLYHSRFLGRA 80 
GCC aL t?T T^C CGA GGC TCC CTG GAG ACT CTG TAC CAT TCC CGC TTC CTG GGC CGA GCC 3 98 
nt.HSNLSLELGPLESGDSGN100 
CAG C^A CAC AGC AAC CTC AGC CTG GAG CTC GGG CCG CTG GAG TCT GGA GAC AGC GGC AAC 458 
. „i MVDTRGQPWTOTLCLK120 
T^C T?C G?G tJg a?G GTG GAC ACA AGG GGC CAG CCC TGG ACC CAG ACC CTC CAG CTC AAG 518 

PRPVVQVFIAVERDA140 
G^G tIc gat oca GTG CCC AGG CCC GTG GTA CAA GTG TTC ATT GCT GTA GAA AGG GAT GCT 578 
«n = KTCQVFLSCWAPNISEI 160 
CAG CCC TCC aIg ACC TGC CAG GTT TTC TTG TCC TGT TGG GCC CCC AAC ATC AGC GAA ATA 638 
. UBBETTMDFGMEPHSLF180 
ACC tIt AGC TGG CGA clo GAG ACA ACC ATG GAC TTT GGT ATG GAA CCA CAC AGC CTC TTC 698 
V L S I S L G P G D R D V A y S 200 

} GGA CCA GGA GAC AGA G 

p W D S C 220 



ACA GAC GGA CAG G^G C^G AGC A^T TCC C^G GGA CCA GGA GAC AGA GAT GTG GCC TAT TCC 758 

TGC A^T G^C TCC A^C CCT G^C AGC TGG GAC T^G GCC ACA G^C ACG CCC TGG GAT AGC TGT 818 

KASyKDVLLVVVP240 
CAT CAT ola G^ GCA CCA clo AAG GCC TCC TAC AAA GAT GTG CTG CTG GTG GTG GTG CCT 878 
, , MLVTLFSAWHWCPCS260 
olc rlo C^G C^C C^G A^G CTG GTT ACT CTC TTC TCT GCC TGG CAC TGG TGC CCC TGC TCA 938 
. p „ L R S K Q L W M R M D L Q _L S JU^^H^ 2B0 
^ A AAG CAG CTC TGG ATG AGA TGG GAC CTG CAG ' 

H Q G L 300 



GGG CCC CAC CTC AGA TCA A^G CAG CTC TGG ATG AGA TGG GAC CTG CAG CTC TCC CTC CAC 998 

GTG ACT C^T AGC aIc CTC A^T TCG ACA G^G G^T TGT AGC G^G GTG CAC CAG GGC CTT 1058 

V s 0 I H T A L I K F P S L M K K K K K 320 

GTT GAA CAG ATC CAC ACT GCT CTA ATA AAG TTC CCA TCC TTA ATG AAA AAA AAA AAA AAA 1118 
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Input file T195AthLal70f 10; Output File T195AthLal70f 10 .pat 
Sequence length 28 94 

MVMRPLWSLLLWE 13 
CCCACGCGTCCGCTCCAGAGAAAG ATG GTC ATG AGG CCC CTG TGG AGT CTG CTT CTC TGG GAA 63 

ALLPITVTGAQVLSKVGGSV 33 
GCC CTA CTT CCC ATT ACA GTT ACT GGT CCC CAA GTG CTG AGC AAA GTC GGG GGC TCG GTG 123 

LLVAARPPGFQVREAIWRSL 53 
CTG CTG GTG GCA GCG CGT CCC CCT GGC TTC CAA GTC CGT GAG GCT ATC TGG CGA TCT CTC 183 

WPSEELLATFFRGSLETLYH 73 
TGG CCT TCA GAA GAG CTC CTG GCC ACG TTT TTC CGA GGC TCC CTG GAG ACT CTG TAC CAT 24 3 

SRFLGRAOLHSNLSLELGPL 93 
TCC CGC TTC CTG GGC CGA GCC CAG CTA CAC AGC AAC CTC AGC CTG GAG CTC GGG CCG CTG 303 

ESGDSSNFSVLMVDTRGQPW113 
GAG TCT GGA GAC AGC AGC AAC TTC TCC GTG TTG ATG GTG GAC ACA AGG GGC CAG CCC TGG 363 

TQTLQ LKVYDA VPR, PVVQVF 133 
ACC CAG ACC CTC CAG CTC AAG GTG TAC GAT GCA GTG CCC AGG CCC GTG GTA CAA GTG TTC 423 

IAVERDAQPSKTCQVFLSCW153 
ATT GCT GTA GAA AGG GAT GCT CAG CCC TCC AAG ACC TGC CAG GTT TTC TTG TCC TGT TGG 483 

APNlSElTySWRRETTMDFG173 
GCC CCC AAC ATC AGC GAA ATA ACC TAT AGC TGG CGA CGG GAG ACA ACC ATG GAC TTT GGT 54 3 

MEPHSLFTDGQVLSISLGPG193 
ATG GAA CCA CAC AGC CTC TTC ACA GAC GGA CAG GTG CTG AGC ATT TCC CTG GGA CCA GGA 60 3 

DRDVAYSCIVSNPVSWDLAT213 
GAC AG A GAT GTG GCC TAT TCC TGC ATT GTC TCC AAC CCT GTC AGC TGG GAC TTG GCC ACA 66 3 

VTPWDSCHHEAAPGKASyKD233 
GTC ACG CCC TGG GAT AGC TGT CAT CAT GAG GCA GCA CCA GGG AAG GCC TCC TAC AAA GAT 72 3 

yLLVVVPV .SLLLMLVTLFSA253 
GTG CTG CTG GTG GTG GTG CCT GTC TCG CTG CTC CTG ATG CTG GTT ACT CTC TTC TCT GCC 78 3 

WHWCPCSGKKKKDVHADRVG273 
TGG CAC TGG TGC CCC TGC TCA GGG AAA AAG AAA AAG GAT GTC CAT GCT GAC AGA GTG GGT 84 3 

PETENPLVQDLP* 286 
CCA GAG ACA GAG AAC CCC CTT GTG CAG GAT CTG CCA TAA 882 

AGGACAATATGAACTGATGCCTGGACTATCAGTAACCCCACTGCACAGGCACACGATGCTCTGGGACATAACTGGTGCC 961 
TGG AAATCACCATGGTCCTCATATCTCCCATGGG AATCCTGTCCTGCCTCGAAGGAGCAGCCTGGGCAGCCATCACACC 104 0 
ACGACGACAGGAAGCACCAGCACGTTTCACACCTCCCCCTTCCCTCTCCCATCTTCTCATATCCTGGCTCTTCTCTGGC 1119 



r 
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CAAGATGAGCCAAGCAGAACATTCCATCCAGGACACTGGAAGTTCTCCAGGATCCAGATCCATGGGGACATTAATAGTC 1198 

CAAGGCATTCCCTCCCCCACCACTATTCATAAAGTATTAACCAACTGGCACCAAGGAATTGCCTCCAGCCTGAGTCCTA 1277 

GGCTCTAAAJ.GATATTACATATTTGAACTAATAGAGGAACTCTGAGTCACCCATGCCAGCATCAGCTTCAGCCCCAGAC 1336 

CCTGCAGTTTGAGATCTGATGCTTCCTGAGGGCCAAGGCATTGCTGTAAGAAAAGGTCTAGAAATAGGTGAAAGTGAGA 143 = 

GGTGGGGGACAGGGGTTTCTCTTTCTGGCCTAAGGACTTTCAGGTAATCAGAGTTCATCGGCCCTCAAAGGTAAATTGC 1514 

AGTTGTAGACACCGAGGATGGTTGACAACCCATGGTTGAGATGGGCACCGTTTTGCAGGAAACACCATATTAATAGACA 1593 

TCCTCACCATCTCCATCCGCTCTCACGCCTCCTGCAGGATCTGGGAGTGAGGGTGGAGAGTCTTTCCTCACGCTCCAGC 1672 

ACAGTGGCCAGGAAAAGAAATACTGAATTTGCCCCAGCCAACAGGACGTTCTTGCACAACTTCAAGA^^ 1751 

CTCAGGATGAGTCTTCCTGCCTGAAACTGAGAGAGTGAAGAACCATAAAACGCTATGCAGAAGGAACATTAT^^^^ 1830 

AAGGGTACTGAGGCACTCTAGAATCTGCCACATTCATTTTCAAATGCAAATGCAGAAGACTTAC^^^ 1509 

ggggacaaagaccccacagcccaacagcaggactgtagaggtcactctgactccatcaaactt™ 1998 

TAGGAAAATACATTCTGCCCCTGAATGATTCTGTCTAGAAAAGCTCTGGAGTATTGATCACT^^^ 2067 

GGAGCTAAACTTACCrrCGGGGATTATTAGCTGATAAGGTTCACAGTTTCTCTCACCCAGGTGTAAC^^ 2146 

GGGGCCTCAATCCAGTCTTGATAACAGCGAGGAAAGAGGTATTGAAGAAACAGGGGTGGGTTTGAAGTAC™ 2225 

AGGGTGGCTTCAATCTCCCCACCTAGGATGTCAGCCCTGTCCAAGGACCTTCCCTCTT^^^ 2304 

ACrrCACCTTGGACAAAGGATCAGCACAGCTGGCCTCCAGATCCACATCACCACTC^^ "83 

CTCCCTGCCTGGCCTGCTCAGAGGTTCCCTGTTGGTAACCTGGCTTTATCAAATTCTCATCC^^^^ 2462 

GCCACCTACAGTCAGGCCACATGCCTGGTCACTGAATCATGCAAAACTGGCCTCAGTCCC^ 2620 

;^GCCCACGATCTGACAATGAGCCCTGGTGGATTTGTGGGGAAAAAATAC^^^^^^ ^^^^ 

TCTCCAGGGCCCCACCTCAGATCAAAGCAGCTCTGGATGAGATGGGACCTGCAGCTCTCCCTCCA^^^^ 277B 

GCAACCrCATTrCGACAGTGGTrTGTAGCCTGGTGCACCAGGGCCTTGTT.3AACAGATCCACACTGCTC^^^ 2857 

CCCATCCTTAATGAAAAAAAAAAAAAAAAAAAAAAAA 
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